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PREFACE 


The present book is the second of several volumes which are 
intended to replace the Introduction to Theoretical Physics written 
by the same authors in 1933. By separating the material on mechanics, 
on electromagnetism, and on the quantum theory, we believe that it is 
possible to give a somewhat better rounded treatment of each of these 
fields, which will be more useful to the teacher and the student. We 
have taken advantage of the opportunity to give a considerably more 
complete treatment of the foundations of electrostatics and magneto¬ 
statics, and to introduce some of the new developments in electro¬ 
magnetic theory since 1933. At the same time, we have tried to 
preserve, and even to extend, the general unity of treatment which 
we believed so important when the earlier book was published. We 
have a conviction that the teaching of theoretical physics in a number 
of separate courses, as in mechanics, electromagnetic theory, potential 
theory, thermodynamics, and modern physics, tends to keep a student 
from seeing the unity of physics, and from appreciating the importance 
of applying principles developed for one branch of science to the prob¬ 
lems of another. 

We have developed electromagnetism from first principles, and 
have included in the appendixes enough of the necessary mathematical 
background so that the student familiar with the calculus and differ¬ 
ential equations can follow the work, without further training in 
mathematics or in mathematical physics. Nevertheless, electro¬ 
magnetic theory is a later historical development than mechanics, and 
it makes use of many mathematical methods, as in vector and tensor 
analysis, potential theory, and partial differential equations, which 
were first developed for mechanics, and find their most straightforward 
applications in mechanical problems. Although we feel that the 
present volume is designed so that it can be used by itself, a thorough 
grounding in mechanics is a very desirable prerequisite. The student 
who first familiarizes himself with the companion volume on mechanics 
will find many similarities in treatment of the two fields, which will 
enhance his understanding of both. 

Electromagnetic theory has developed in two principal directions 
since its original formulation in the middle of the last century: toward 
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the electromagnetic theory of light, treating very short wave lengths, 
and providing the mathematical foundation for the theory of optics; 
and toward the longer wave lengths encountered in electrical engineer¬ 
ing and radio. In the last few years, the practical applications have 
come at constantly shorter wave lengths, until with the recent develop¬ 
ment of microwaves the gap between the two branches of the subject 
has been practically closed. We handle problems of both types, mak¬ 
ing no distinction between the methods used for them. 

In 1933, it was almost universal practice to use the Gaussian system 
of units for handling electromagnetic problems, and Introduction to 
Theoretical Physics is written in terms of those units. Since then, the 
mks, or meter-kilogram-second, system has come into co mm on use, 
and we have made use of that system in the present volume, believing 
that its advantages over the Gaussian system are considerable. We 
give in an appendix, however, a discussion of the various systems of 
units, including a formulation of all the important equations in terms 
of the Gaussian system, so that the reader will not be at a disadvan¬ 
tage in consulting other books using the Gaussian units, and so that 
he will understand properly the relations among the various systems. 

We quote from the preface of Introduction to Theoretical Physics: 

“In a book of such wide scope, it is inevitable that many important 
subjects are treated in a cursory manner. An effort has been made to 
present enough of the groundwork of each subject so that not only is 
further work facilitated, but also the position of these subjects in a 
more general scheme of physical thought is clearly shown. In spite 
of this, however, the student will of course make much use of other 
references, and we give a list of references, by no means exhaustive, but 
suggesting a few titles in each field which a student who has mastered 
the material of this book should be able to appreciate. 

“At the end of each chapter is a set of problems. The ability to 
work problems, in our opinion, is essential to a proper understanding of 
physics, and it is hoped that these problems will provide useful prac¬ 
tice. At the same time, in many cases, the problems have been used to 
extend and amplify the discussion of the subject matter, where limita¬ 
tions of space made such discussion impossible in the text. The 
attempt has been made, though we are conscious of having fallen far 
short of succeeding in it, to carry each branch of the subject far enough 
so that definite calculations can be made with it. Thus a far surer 
mastery is attained than in a merely descriptive discussion. 

** Finally, we wish to remind the reader that the book is very defi- 
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nitely one on theoretical physics. Though at times descriptive 
material, and descriptions of experimental results, are included, it is 
in general assumed that the reader has a fair knowledge of experimental 
physics, of the grade generally covered in intermediate college courses. 
No doubt it is unfortunate, in view of the unity which we have stressed, 
to separate the theoretical side of the subject from the experimental in 
this way. This is particularly true when one remembers that the 
greatest difficulty which the student has in mastering theoretical 
physics comes in learning how to apply mathematics to a physical 
situation, how to formulate a problem mathematically, rather than 
in solving the problem when it is once formulated. We have tried 
wherever possible, in problems and text, to bridge the gap between pure 
mathematics and experimental physics. But the only satisfactory 
answer to this difficulty is a broad training in which theoretical physics 
goes side by side with experimental physics and practical laboratory 
work. The same ability to overcome obstacles, the same ingenuity 
in devising one method of procedure when another fails, the same 
physical intuition leading one to perceive the answer to a problem 
through a mass of intervening detail, the same critical judgment 
leading one to distinguish right from wrong procedures, and to 
appraise results carefully on the ground of physical plausibility, are 
required in theoretical and in experimental physics. Leaks in vacuum 
systems or in electric circuits have their counterparts in the many 
disastrous things that can happen to equations. And it is often as 
hard to devise a mathematical system to deal with a difficult problem, 
without unjustifiable approximations and impossible complications, as 
it is to design apparatus for measuring a difficult quantity or detecting 
a new effect. These things cannot be taught. They come only from 
that combination of inherent insight and faithful practice which is 
necessary to the successful physicist. But half the battle is over if the 
student approaches theoretical physics, not as a set of mysterious 
formulas, or as a dull routine to be learned, but as a collection of 
methods, of tools, of apparatus, subject to the same sort of rules as 
other physical apparatus, and yielding physical results of great impor¬ 
tance. . . . The aim has constantly been, not to teach a great collec¬ 
tion of facts, but to teach mastery of the tools by which the facts have 
been discovered and by which future discoveries will be made.” 

John C. Slater 

Nathaniel H. Frank 

Cambridge, Mass., 

July , 1947 . 
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CHAPTER I 

THE FIELD THEORY OF ELECTROMAGNETISM 

A dynamical problem has two aspects: mechanics, the determina¬ 
tion of the accelerations and hence of the motions, once the forces are 
given; and the study of the forces acting under the existing circum¬ 
stances. The basic principles of mechanics are simple. In its classical 
form, mechanics is based on Newton’s laws of motion, laws discovered 
and formulated nearly three hundred years ago. The developments 
since then have been technical, mathematical improvements in the way 
of formulating the laws and solving the resulting mathematical 
problems, rather than additions to our fundamental knowledge of 
mechanics. Only in the present century, with wave mechanics, has 
there been a change in the underlying structure of the subject. 

The study of forces, on the other hand, is difficult and complex. 
The first forces brought into mathematical formulation were gravi¬ 
tational forces, as seen in planetary motion. Next were elastic 
forces. Then followed electric and magnetic forces, which are the 
subject of this volume. Their study was mostly a product of the 
nineteenth century. During the present century, it has become clear 
that electromagnetic forces are of far wider application than was first 
supposed. It has become evident that, instead of being active only 
in electrostatic and magnetostatic experiments, and in electromagnetic 
applications such as the telegraph, dynamo, and radio, the forces 
between the nuclei and electrons of single atoms, the chemical forces 
between atoms and molecules, the forces of cohesion and elasticity 
holding solids together, are all of an electric nature. We might be 
tempted to generalize and suppose that all forces are electromagnetic, 
but this appears to be carrying things too far. The prevailing evidence 
at present indicates that the intranuclear forces, holding together the 
various fundamental particles of which the nucleus is composed, are 
not of electromagnetic origin. These forces, of enormous magnitude, 
and appearing in the phenomena of radioactivity and of nuclear 
fission, appear subject to laws somewhat analogous to the electromag¬ 
netic laws, but fundamentally different. In spite of this, the range of 
phenomena governed by electromagnetic theory is very wide, and it 
carries us rather far into the structure of matter, of electrons and 
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nuclei and atoms and molecules, if we wish to understand it completely. 
The equations underlying the theory, Maxwell’s equations, are rela¬ 
tively simple, but not nearly so simple as Newton’s laws of motion. 
Instead of stating the whole fundamental formulation of the subject 
in the first chapter, as one can when dealing with mechanics, about 
half of the present book is taken up with a complete formulation of 
Maxwell’s equations. We start with simple types of force, electro¬ 
static and magnetostatic, and gradually work up to problems of 
electromagnetic induction and related subjects, all of which are 
formulated in Maxwell’s equations. 

In the development of electromagnetic theory, there has been a 
continual and significant trend, which in a way has set the pattern 
for the development of all of theoretical physics. JLhis has been 
the trend away from the concept known as “action at a distance” 
toward the concept of field theory. The classical example of action 
at a distance is gravitation, in which simple nonrelativistic theory 
states that any two particles in the universe exert a gravitational force 
on each other, acting along the line joining them, proportional to the 
product of their masses, and inversely proportional to the square of 
the distance between them. Such a force, depending only on the 
positions of the particles, quite independent of any intervening objects, 
is simple to think about, and formed the basis of most of physical 
thought from the time of Newton, in the latter half of the seventeenth 
century, on well into the nineteenth century. The first electric and 
magnetic laws to be discovered fitted in well with the pattern. First 
among these was Coulomb’s law. Coulomb investigated the forces 
between electrically charged objects, and found that the force between 
two such objects was in the line joining them, proportional to the 
product of their charges (which could be defined by an experiment 
based on this observation), and inversely proportional to the square 
of the distance between them, in striking analogy to the law of gravi¬ 
tation. Magnets similarly fell in with the scheme. A theory of the 
forces between permanent magnets can be built up by considering 
that they contain magnetic north and south poles, and that the force 
between two poles is proportional to the product of the pole strengths, 
and inversely proportional to the square of the distance. It is true 
that single poles do not seem to exist in nature, but an ordinary magnet 
can be considered as made up of equal north and south poles in juxta¬ 
position, a combination known as a “dipole.” 

Coulomb’s studies were carried on in the latter half of the eight¬ 
eenth century. Early in the nineteenth came the discovery of the 
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magnetic effects of continuous currents. First was Oersted’s observa¬ 
tion that electric current flowing in a loop of wire exerted magnetic 
forces on permanent magnets, just as if the loop itself were a magnet. 
Then came Ampere to formulate these observations mathematically, 
showing that the magnetic force resulting from a circuit can be broken 
up into contributions from infinitesimal lengths of wire in the circuit, 
and that each of these forces falls off as the inverse square of the 
distance, a law often known by the names of Biot and Savart. These 
laws of action at a distance suggested that electromagnetism would 
develop along the lines suggested by gravitational theory. 

Michael Faraday, in the first half of the nineteenth century, was 
the first who really turned the electromagnetic theory into the lines of 
field theory. If a piece of insulator, or dielectric, is put between two 
charged objects, the force between the objects is diminished. Faraday 
was not content to regard this as merely a shielding effect, or a change 
in the force constant. He directed attention to the dielectric, and 
concluded that it became polarized, acquired charges which themselves 
contributed to the force on other charges. To describe these effects, 
he introduced the idea of lines of force, lines pointing in the direction 
of the force that would be exerted on a charge located at an arbitrary 
point of space. He gave a physical meaning to the number of lines 
per unit area, setting this quantity proportional to the magnitude of 
the force. He thought of the lines of force in a very concrete way, 
as if there were a tension exerted along them, and a pressure at right 
angles to them, and showed that such a stress system would account 
for the forces actually exerted on charges. Faraday’s fundamental 
idea, in other words, was that things of the greatest importance were 
going on in the apparently empty space between charged bodies, and 
that electromagnetism could be described by giving the laws of the 
phenomena in this space, which he called the “field.” His discovery 
of electromagnetic induction, in which electromotive force is induced 
in a circuit by the time rate of change of magnetic flux through the 
circuit, added certainty to his concepts, by pointing out the importance 
of the magnetic field and its flux. 

Faraday was not a mathematician, and his concepts of the field 
did not immediately appeal to the mathematicians, who were still 
thinking in terms of inverse-square laws. His contemporary Gauss 
furnished the first mathematical formulation of field theory. Gauss 
considered lines of force, their flux out of a region, and proved his 
famous theorem, relating this flux to the total charge within the 
region. It remained for Maxwell, however, some thirty years after 
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Faraday’s first discoveries, to find the real mathematical formulation 
of them. Maxwell accepted wholeheartedly the idea that the electric 
and magnetic fields were the fundamental entities, and considered the 
partial differential equations governing those fields. He had a back¬ 
ground of experience to work on. In addition to the work of Gauss, 
there was the formulation of gravitational theory in terms of the 
gravitational field and potential, which had been worked out at the 
end of the eighteenth century by Laplace, Poisson, and others. At 
the time, that formulation seemed more a mathematical device than 
anything else, but in the hands of Maxwell it furnished an ideal 
mathematical framework for Faraday’s ideas. The electromagnetic 
field is much more complicated than the gravitational, however, and 
Maxwell had to go far beyond Laplace, Poisson, and Gauss, introduc¬ 
ing among other things the concept of displacement current, which 
proved to be necessary to reach a mathematically consistent theory. 
Maxwell’s equations have stood the test of time since then, and still 
furnish the correct formulation of classical electromagnetic theory; 
it is only the quantum theory which has brought about a fundamental 
revision of our ideas, during the last few years. 

As soon as Maxwell formulated his equations, he was able to draw 
from them a mathematical result predicting a new phenomenon, 
which would hardly be suspected from the laws of Coulomb and of 
Faraday which were his starting points. He was able to show that an 
electromagnetic disturbance originated by one charged body would 
not be immediately observed by another, but that instead it would 
travel out as a wave, with a speed that could be predicted from elec¬ 
trical and magnetic measurements. Furthermore, the velocity so 
predicted proved to agree, within the small experimental error, with 
the speed of light. Thus at one blow he accomplished two results 
of the greatest importance in the history of physics. First, he gave a 
convincing proof of the superiority of a field theory to action at a 
distance; secondly, he tied together two great branches of physics, 
electromagnetism and optics. 

To see why action at a distance can hardly explain the propaga¬ 
tion of electromagnetic waves, consider as simple a thing as a radio 
broadcast. In the transmitting antenna, certain charges oscillate 
back and forth, depending on the signal being transmitted. Accord¬ 
ing to the field theory, these charges produce an electromagnetic wave, 
which travels out with the speed of light. The wave reaches a receiv¬ 
ing antenna an appreciable time later, and sets the charges in that 
antenna into oscillation, with results that can be detected in the 
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receiver. The forces on the charges in the receiving antenna are 
not determined at all by the instantaneous positions or velocities of 
the charges in the transmitting antenna, but by the values that they 
had at an earlier time. Any reaction back on the transmitter will 
be delayed by the time taken by the disturbance to reach the receiver, 
and then to return to the transmitter again, as in an echo. The forces 
on a particle, in other words, do not depend on the positions of other 
charges, but on what they did at past times. It is almost impossible 
to formulate this in terms of action at a distance, but easy to formulate 
if we regard the electromagnetic field as a real entity, taking energy 
from the transmitter, and carrying it with a finite velocity to the 
receiver. 

To appreciate the relations between electromagnetism and optics, 
which Maxwell demonstrated, we have to go back somewhat further 
with the development of optics. At the time of Newton and Huygens, 
there were two opposed theories of light, Newton holding a corpuscular 
theory, in which the light was a stream of infinitesimal particles, 
being bent as they passed from one medium to another on account 
of a surface force resulting from different potential energies in the 
various media; whereas Huygens believed that light was a form of 
wave motion, and was able to explain reflection and refraction on 
the basis of the propagation of spherical wavelets, traveling with 
different velocities in different media. Newton’s principal objection 
to the wave theory was his feeling that it did not explain the way in 
which obstacles cast sharp shadows. He was thinking by analogy 
with sound, which was known to be a wave motion, in which sound 
bends around obstacles. The thing he did not realize was that the 
wave lengths of light are so small, and that that entirely changes the 
behavior of shadow formation. It is curious that he did not think 
of this, for he was familiar with the phenomena of interference and 
diffraction; he made a theory of them which postulated a periodic dis¬ 
turbance along the direction of wave propagation, which he described 
as alternate fits of easy reflection and of easy transmission, and by 
measurement of interference patterns he determined the wave length 
of this periodic disturbance, in good agreement with modern measure¬ 
ments of the wave length of light. His combination of corpuscles 
with a periodic disturbance, in fact, showed extraordinary similarity 
to the present picture resulting from the quantum theory, in which we 
picture particles, or photons, traveling in accordance with a guiding 
wave field. 

The explanation of interference and diffraction from the wave 
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theory, together with the proper treatment of the casting of shadows, 
did not actually come until the first years of the nineteenth century, 
when Young and Fresnel made their discoveries in that field. Fresnel 
not only explained these phenomena, but also formulated the laws of 
reflection and refraction, giving laws, which have proved to be correct, 
for the fraction of the incident light reflected and refracted, as well 
as for the direction of the reflected and refracted beams. Those 
laws also explained the phenomenon of double refraction, by which 
certain crystals such as Iceland spar transmit light in two different 
rays, the ordinary and the extraordinary rays, traveling at different 
angles and speeds, and not satisfying the ordinary laws of refraction 
as found in an isotropic medium. The two rays show properties 
described as polarization, which proved to result from the fact that 
light is a transverse, not a longitudinal, vibration, so that two direc¬ 
tions of vibration are possible, both at right angles to the direction of 
propagation. The whole theory of these vibrations shows a close 
analogy to the transmission of transverse elastic vibrations in an 
elastic solid, with the one exception that there is no indication of an 
accompanying longitudinal vibration, such as there would be with 
an elastic solid, and such as would constitute the only mode of vibration 
for a fluid. 

The physicists of the nineteenth century were much devoted to 
mechanical models. If light acted like the transverse vibration of an 
elastic solid, they tried to visualize it as a real solid, and gave it a 
name, the ‘‘ether.” It was hard to understand its properties. In 
the first place, as we have just mentioned, a real elastic solid would 
transmit longitudinal as well as transverse waves, and no good way of 
modifying the theory was found that would eliminate the longitudinal 
waves. In the second place, the solid would have to fill all space, 
and it was obviously very hard to see how, with a very rigid solid 
filling space, it was possible for ordinary bodies to move around 
freely. Much thought was devoted to these questions. Even after 
Maxwell had shown that light was an electromagnetic disturbance, 
not a vibration of a solid, there was still much speculation about the 
nature of the ether. It is really only within the present century that 
physicists have realized that that speculation is essentially meaning¬ 
less, that the electric and magnetic fields are the fundamental entities 
concerned with optics as well as with electromagnetic forces, and that 
we do not have to endow these fields with mechanical properties 
foreign to their real nature. 

It was this background of an elastic-solid theory of light which 
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Maxwell encountered when he formulated his electromagnetic theory. 
That theory at once removed all the difficulties of the previous theories. 
It yielded only the transverse vibrations, having no solutions of the 
fundamental equations corresponding to longitudinal waves. Fres¬ 
nel’s equations for refraction and Reflection, and his explanations of 
interference and diffraction, though proposed for an elastic-solid 
theory, proved to be equally valid in the framework of electromagnetic 
theory. And the explanation of the casting of shadows, resulting 
from the small wave length shown to exist by experiments on inter¬ 
ference and diffraction, properly answered Newton’s objection to the 
wave theory. Taken together with the remarkable success of the 
theory in predicting the velocity of light from purely electrical meas¬ 
urements, all doubt about the electromagnetic nature of light almost 
immediately disappeared, and optics is now treated as a branch of 
electromagnetism, as we shall treat it in this volume. The argument 
was placed beyond question a few years later, when Hertz demon¬ 
strated the existence of electromagnetic waves of wave lengths of a 
few centimeters. This was soon followed by the use of much longer 
electromagnetic waves for radio communication, leading back in the 
last few years to the use of microwaves, of a few centimeters in length, 
for m ing one of the most perfect examples of the application of Max¬ 
well’s equations. 

The developments we have described comprise most of the classical 
part of electromagnetic theory, the part that is taken up in the present 
volume. Two more recent advances have concerned themselves with 
the quantum theory. First, and more revolutionary, has been the 
discovery of the photon, and of the fact that light has an aspect that 
must be explained in a corpuscular way, as well as a wave aspect. 
This was at first a mathematical deduction by Planck, from the theory 
of black-body radiation, at the beginning of the present century, and 
it was followed shortly by Einstein’s deduction of the law of photo¬ 
electric emission, from Planck’s quantum hypothesis. Light of rather 
short wave length, falling on a suitable metal surface, ejects electrons; 
these electrons are observed to have an energy depending only on the 
frequency of the light, being in fact proportional to the frequency, 
and being independent of the intensity of the light, which regulates 
merely the number of photoelectrons, not their energy. this sug¬ 
gested strongly that the energy of the light was carried by certain 
corpuscles, or photons, shown by Einstein to carry a quite finite 
energy hf, where h is Planck’s constant, / the frequency, and that a 
photon absorbed by an electron of the metal transferred its energy 
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to the electron, so that it was ejected as a photoelectron. Such a 
hypothesis was in direct conflict with the wave theory, which as we 
shall see predicts a continuous distribution of energy, and for a number 
of years this conflict was regarded as a great stumbling block in the 
development of the quantum theory. 

The difficulty still exists, in its way, but it is gradually being 
worked out in the framework of developing quantum electrodynamics.. 
It is becoming recognized that the corpuscular and the wave point 
of view simultaneously have their truth, and that the wave is the 
correct description of the average behavior of the photons, but does 
not predict the behavior of an individual photon, which in fact seems 
not to be predictable by any precise theory. The photons move so 
that on the average they deliver the same energy to any illuminated 
area that would be predicted by the wave theory, but the energy is 
actually delivered by the photons, in discrete amounts. At compara¬ 
tively long wave lengths, or low frequencies, the energy of the photons 
is so small that a great many are delivered by a source of ordinary 
energy, and their discreteness is not of importance. At very high 
frequencies, however, the finite size of the photons is of striking sig¬ 
nificance, as can be seen in experiments with counters and cloud 
chambers, in which evidence of a single photon (or of a single electron 
or ion) can be directly observed. 

The less striking, but probably equally important, development of 
electromagnetic theory in the last few years has been its place in the 
quantum theory of atomic and molecular structure. Bohr, in 1913, 
was able to explain the structure of a hydrogen atom, as being made 
up of a proton and an electron, acting on each other by ordinary 
electrostatic forces, but governed by quantum mechanics. Develop¬ 
ing this theory, Schrodinger in 1926 proposed his wave mechanics, 
a form of quantum mechanics suggested by the duality between 
corpuscles and waves in the theory of light. It had been suggested by 
de Broglie that there was a wave phenomenon associated with the 
motion of electrons, as there was with the motion of photons, and 
Schrodinger formulated this wave in terms of a wave equation, some¬ 
what similar to the wave equations of optics. By solving this equa¬ 
tion, we find descriptions of great accuracy of the structure of atoms, 
molecules, and solids and matter of all types. The interesting point 
is that the forces that enter into Schrodinger’s theory are essentially 
electromagnetic forces as described by classical electromagnetic 
theory; it is in the mechanics that this theory differs from purely 
classical theory. Thus we understand the very broad way in which 
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electromagnetic forces underlie a great segment of chemistry, and 
of the theory of the solid, liquid, and gaseous states. 

The applications of Schrodinger’s theory to the structure of matter 
deal almost entirely with particles traveling slowly compared with 
the velocity of light. The electromagnetic theory that has to be 
used is then essentially electrostatics, making no use of the finite 
velocity of propagation of light. When we come to apply the theory 
to particles traveling with velocities comparable with that of light, 
we meet essential difficulties, which have not yet been fully solved. 
We must expect a combination of the theory of radiation, and of the 
electromagnetic forces concerned in an atom or molecule, which will 
be consistent and complete, and this does not yet seem to exist. Of 
course, particles traveling with velocities near that of light must 
satisfy the relativistic mechanics. Einstein, in the early years of the 
century, devised his theory of relativity, to explain the dynamics of 
rapidly moving particles, and the way in which the velocity of light 
appeared the same in any frame of reference. The modifications 
that proved to be necessary were only in the dynamics, not in the 
electromagnetic part of the theory; the forces as predicted by classical 
electromagnetic theory are relativistically correct, as had been shown 
earlier by Lorentz. But, in setting up the quantum electrodynamics 
of rapidly moving particles, it is relativistic rather than Newtonian 
mechanics which must be used as a start, and it is this combination 
which is not yet without its difficulties. 

Probably part of the trouble with these theories is the fact that 
new phenomena are appearing as we go to very rapid particles of very 
high energy, which complicate the whole theory. These are the 
phenomena of the forces between the particles, neutrons and protons, 
in the nucleus. There is good evidence that these forces are not 
electromagnetic, but of another variety, produced not by the electro¬ 
magnetic field, but by a meson field. This field has been described 
mathematically by analogy with electromagnetic theory. It similarly 
has a wave aspect, but also a corpuscular aspect, the corpuscles in 
this case being mesons, rather than photons. It is thus becoming 
likely that there exist in nature a number of different levels of forces 
and particles: the electromagnetic field and the photons, the electrons 
and protons and neutrons and their associated fields of the de Broglie 
or Schrodinger type, and the mesons and their fields. And the guid¬ 
ing pattern for the development of all these theories is at present the 
electromagnetic theory in its classical form. It remains to be seen 
what final synthesis of these various theories can eventually be made. 
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We are now ready to proceed with our study of classical electro¬ 
magnetic theory, which alone we shall take up in this volume. We 
start in the historical order, treating electrostatics, first from the basis 
of Coulomb’s law, then from the standpoint of field theory, showing 
- as Faraday did how much that helps us in the study of problems 
involving dielectrics and conductors. Then we take up in a similar 
way the magnetic field, passing on through the study of electromag¬ 
netic induction to Maxwell’s equations, and to their application to the 
electromagnetic theory of light and of other electromagnetic radiation 
fields. 

1. The Force on a Charge.—Electromagnetic theory deals with the 
forces acting on charges and currents. We find that an electric charge 
at a given point of space is acted on by two types of force: an electric 
force, independent of its velocity, and a magnetic force, proportional 
to its velocity (that is, to the current carried by the charge), and at 
right angles to its velocity. We find that different charges at the 
same point of space are acted on by different amounts of force, and we 
arbitrarily define the strength of the charge as being proportional to 
the magnitude of force acting on it in a given field. We shall see 
later how to define the unit of charge, the coulomb. We can define 
two vectors at every point of space, E the electric intensity, B the 
magnetic induction, such that the force F on a charge of q coulombs 
moving with velocity v is given by the vector equation 

F = q(E + v X B). (1.1) 

Here v X B is the vector product of v and B, a vector at right angles 
to both, whose magnitude is the product of the magnitudes of v and B 
times the sine of the angle between. The reader unfamiliar with this 
and other aspects of vector notation will find vector methods discussed 
in Appendix I. To measure E and B we need only measure the force 
on a moving charge at the point in question. The second term is the 
ordinary motor rule, that the force on a current element is proportional 
to the current, and the component of magnetic field at right angles to 
it, and is at right angles to each. In the inks (meter-kilogram-seeond) 
system of units, which we shall use, the force will be given by (1.1) in 
newtons (1 newton = 10 5 dynes), if the charge q is in coulombs, if E 
is in volts per meter, v in meters per second, and B in webers per 
square meter (1 weber/sq m = 10 4 gausses). The inks systenq. of 
units is discussed in Appendix II, in which a discussion is also given 
of various other commonly used sets of units, and of the form that 
familiar equations take in these other units. We shall see later how 
to define the volt and the weber, as well as the coulomb. 
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As far as charge is concerned, with our knowledge that all matter 
is composed of electrons, protons, neutrons, and other such elementary 
particles, all of which prove experimentally to carry charge equal to an 
integral multiple of the electronic charge e, given by 

e = 1.60 X 10~ 19 coulomb, (1.2) 

we see that the determination of the charge on a body really resolves 
itself into a counting of the unbalanced elementary charges on it. 
This fact, that all charges are integral multiples of a fundamental 
unit, is still one of the unexplained puzzles of fundamental physics. 
It does not in any way contradict electromagnetic theory, but it is 
not predicted by it, and until we have a more fundamental theory 
that explains it, we shall not feel that we really understand electro¬ 
magnetic phenomena thoroughly. Presumably its explanation will 
not come until we understand quantum theory more thoroughly than 
we do at present. 

2. The Field of a Distribution of Static Point Charges.—For a 

number of chapters we shall be dealing with the forces on charges at 
rest, and shall consider only the electric field E, which will be assumed 
to be independent of time. This is the branch of the subject known as 
.“electrostatics. ’’ It is now found that the field E can be determined, 
once the distribution of charge is known. This fact, together with 
(1.1) and Newton’s laws of motion, furnishes a complete system of 
equations: knowing where the charge is, we find E; knowing E, we 
find the force F; knowing the force, we find the acceleration of the 
particles bearing the charge, and hence their motions. Of course, 
our special case of electrostatics must be that in which the total 
force, which may include nonelectrostatic as well as electrostatic 
forces, acting on a body, is zero, so that the body can stay at rest, and 
we can be dealing with a static problem. 

The law giving the field E arising from a distribution of charge 
is very simple. We may consider the charge to be made up of a 
great many small, or point, charges (which on an atomic scale could 
be electrons and protons). We may compute the field resulting from 
each of these point charges. Then the first experimental law is that 
the total field is the vector sum of the fields arising from the various 
point charges. Furthermore, the field E of a static point charge q 
proves to be in the direction pointing away from q, and is equal in 
magnitude to g/47reor 2 , where 

e 0 = 8.85 X 10“ 12 farad/m, (2.1) 

and where r is the distance from the charge, measured in meters. 
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This may for the present be considered an experimental law, with 
(2.1) as an experimentally determined coefficient, determined to 
give the result for E in volts per meter, if q is in coulombs. In Appen¬ 
dix II we shall see why it is that in terms of the definition of the volt 
and the coulomb we have this particular numerical constant. The 
factor 4tt is included in the formula at this point, because it is more 
convenient to do so for later applications, as we shall see presently. 
We shall also understand later the units, farads per meter, which we 
have assigned to eo in (2.1). 

As a result of our simple law giving the field arising from a point 
charge, we can find easily the force between two charges q and q', 
. at a distance r apart. The force is clearly in the direction joining 
them, and is equal in magnitude to 


F - 


m' 

4jr€or 2 


( 2 . 2 ) 


It is a repulsion if q and q' have the same sign, an attraction if they 
have opposite sign. Equation (2.2) is Coulomb’s law, in the form 
that it takes in the mks system; we give corresponding expressions in 
other systems of units in Appendix II. In Eq. (2.2) we have an 
illustration of the fact that problems in electrostatics can be handled 
by the ideas of action at a distance, since Coulomb’s law is similar to 

the law of gravitation. 

It is such a simple matter to find 
the field of a point charge that we 
can easily sum such fields vectorially, 
to find the field of an arbitrary charge 
distribution, provided that we know 
P where the charges are located. 
Simple examples of the field of distri- 
Fig. i.—F ield Of a charged line. butions of particles are the fields of a 

uniformly charged line and plane. 
Let a line carry a constant charge o- per unit length. The contribution 
of the charge in length dx to the field atP, in Fig. 1, is along the direc¬ 
tion of R; its component along r, which alone integrates to something 
different from zero, is (<r dx/4ireoR 2 )(r/R) = err dx/AmoR 3 . The result¬ 
ant field is 



or f °° dx _ cr 

4it€o J — ao Or 2 r 2 )^ %Treor 


Similarly let a surface carry a constant charge of a- per unit area. The 
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contribution of the charge in the ring of radius between x and x + dx 
in the plane, shown in Fig. 2, to the component of field along the 
normal, at P, is (2tt(tx dx/4:TT€oR 2 )(r/R) = <rrx dx/2eoR z } and the 
resultant field is 

or f x x dx _ <r 

2To Jo O 2 + r 2 )^ “ 2eo 

a constant independent of the distance from the plane. It should be 
noted, however, that the field is directed away from the plane on each 
side of the plane (provided cr is positive), so that there is a discon* 



tinuity of o/e Q in the normal component of E in passing through the 
plane. This problem of the field resulting from a distribution of 
charge on a plane illustrates the reason why the factor 4ir was included 
in the statement of Coulomb’s law, as in (2.2). The reason is that 
this factor then drops out in problems such as the field of a plane 
charge. If we had not included the 4 tt at first, it would have appeared 
at this po in t. Since more problems are similar to that of the field 
of a plane than to the field of a point charge, it is more convenient to 

handle the factor Air as we have done. 

The two examples that have just been presented show how simple 
it is to find fields by direct integration in some cases. We shall now 
go on to a more involved, but more powerful, method of finding the 
field of a distribution of charge, using the method of the potential. 
This method has two advantages over direct integration. First, it is ^ 
more powerful mathematically; there are many problems to which it 
will provide solutions, and in which the direct integration proves to be 
too difficult to carry out. Secondly, there is a whole class of problems 
in which direct integration is of no use, for we do not know the charge 
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distribution to start with. This class of problems includes those 
involving conductors, or dielectrics. With such bodies, as we shall 
see, the presence of certain charges in the neighborhood induces other 
charges on the conductors or dielectrics, or, as we say, polarizes them. 
Part of our problem is to find the nature and location of this polarized 
charge. .The method of the potential can handle this problem, whereas 
direct integration cannot. As a first step in taking up the potential, 
we consider the work done in moving a point charge from one place to 
another in the field, and the potential-energy function resulting from 
this work done. 

3. The Potential.—The work that we must do in carrying a unit 
charge from one point to another in the field of a point charge, balanc¬ 
ing the force exerted by the point charge, is independent of the path. 
To prove this, we note that the work done in going from one point to 
an adjacent point is the product of the force ~g/4xeor 2 which we exert 
to balance the electrostatic force, by the component of displacement 
dr in the direction of force, or is —E • ds, where E is the electric field, 
ds is the vector displacement, and where the scalar product E • ds 
equals the product of the magnitudes of E and ds, and of the cosine 
of the angle between them, as is discussed in Appendix T. Thus, 
integrating, 

Work = — f E • ds = / — ~ 2 dr = ( —-(3.1) 

J i Jn 4'7T€ 0 r 2 4 t 6 0 \r 2 n/ 


This work may then be written as the difference of a potential energy 
at the end points: if we write 


<p 


Q 

47T€o r 


(3.2) 


as the potential function at a distance r from a charge q , the work 
done moving a unit charge from point 1 to point 2, by (3.1), is 

Work — <p 2 — <p\. 

Then, as always with problems involving potentials, we may write the 
force as the negative gradient of the potential: 

E = — grad <p = — V<p, (3.3) 

where the vector operations are discussed in Appendix I. Further¬ 
more, using the principle of vector analysis that the curl of any gradient 
is zero, we have 

curl E = V X E = 0. (3.4) 

These relations (3.3) and (3.4) hold for the field of a point charge; by 
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addition, they hold for the field of any arbitrary number of point 
charges. 

Use of this principle gives us a simplified way of computing the 
field of a distribution of charges. We sum the potentials of the 
charges, so as to get the total potential as a function of position; 
then we take its gradient, to get the field. This allows us to sum 
the potential, a scalar, rather than the field, a vector. If charges 
are distributed continuously in space, instead of at discrete points, 
we may describe them by a charge density p, such that p dv is the 
charge located in volume element dv , so that p is in coulombs per 
cubic meter in the mks system. Then we may write the potential 
of this distribution as 



In this expression, we are finding <p as a function of x, y, and z ; the 
integration is over dx' dy' dz f = dv, the coordinates of the point where 
the density p, which is really a function of x ', y', z ', is found, r is the 
distance between these two points, given by 

r = \Z(x — z') 2 + (y — 2/') 2 4- (z — z') 2 - 

The potential is expressed in volts. It will automatically satisfy (3.4), 
from the way in which it is built up. From <p as given in (3.5), we 
then find E by (3.3). 

Surfaces <p — constant are called “ equipotential surfaces.” The 
field E is normal to the equipotential surfaces, as we can see either 
from the principle of vector analysis that the gradient of a scalar 
is normal to the surfaces on which the scalar is constant, or from 
the elementary fact that, since the work done on a charge moving 
from one point to another of the same equipotential surface must be 
zero, any displacement on that surface must be at right angles to the 
force vector. We may draw lines, called “lines of force,” everywhere 
tangent to E; they then cut the equipotentials at right angles. These 
concepts, of the sort stressed particularly by Faraday, are particularly 
useful in electrostatic problems involving conductors. In a conductor, 
charges are free to move about. Thus, if conductors are present in a 
field, we cannot predict straightforwardly where the charge is to be 
found, and hence cannot find <p by (3.5), since we do not know p. 
We do know, however, that a conductor satisfies Ohm’s law, which 
states that the current in a conductor is proportional to the electric 
field in it. If our problem is a static one, in which all charge is at rest 
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and there is no current, the field must then be zero everywhere within 
the conductor. The potential must then be constant, by (3.3). Thus 
a conductor forms an equipotential volume, its surface being an equi- 
potential surface, so that all lines of force must cut it at right angles. 

4. Electric Images. —Suppose we are given the problem of finding 
the potential, and hence the field, of a certain set of charges, q lt 
q 2} . . . , at specified points, in the presence of certain conductors 
that are maintained at definite potentials. The potential must then 



Fig. 3.—Electric image of a point change in a conducting plane. 


satisfy the following conditions: it must be equal to the required 
constant value on the surface of each conductor, and it must reduce 
to a value like (3.2) at each of the charges. These conditions can be 
shown to determine the potential uniquely. Sometimes by a simple 
device we can set up a potential, or the corresponding field, satisfying 
these conditions; we can then be sure we have solved our problem. 
Thus consider a single charge q at a distance d from a grounded 
conducting plane. There is a problem, shown in Fig. 3, which has a 
potential that reduces to the proper value where the charge is, and 
which gives a potential zero on the surface, by symmetry: it is the 
problem of two equal and opposite charges ±q, at a distance 2d apart, 
the mid-plane taking the place of the surface. The correct lines of 
force and equipotentials are then as shown in Fig. 3, the dotted exten¬ 
sions being really nonexistent. The fictitious charge —q behind the 
metallic surface is called an “electric image,” from the analogy of an 
optical image, and this method is called the “method of images.” 
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The method of images can also be used for the problem of a point 
charge and a conducting sphere. This depends on a geometrical 
theorem. In Fig. 4, let oa/oc = oc/ob. The triangles oac and obc are 



Fig. 4.—Image of a point charge in a conducting sphere. 

similar, for they have the angle aoc in common, and the sides including 
the angle are proportional. Thus 


oa _ oc _ r i 
oc ob rz 


(4.1) 


If we then place a charge q at b, and a charge — (oc/ob)q at a, the 
potential at any point of the circle will be 

1 - (?£) 1 = 1 - 1 = o 
r 2 \ob/r i r 2 r 2 ’ 

using (4.1). Thus the circle, or the sphere formed by rotating it 
around the axis ob, is an equipotential of zero potential. Hence the 
two charges, q at b and its image — ( oc/ob)q at a, give a field that is a 
solution of the problem. A somewhat more general problem can be 
solved by adding to the field of these two charges the field of an arbi¬ 
trary point charge at the center o of the circle. This charge will give a 
potential that is constant over the surface of the sphere, so that 
the field of all three charges is consistent with the assumption that the 
sphere is a conductor. By adjusting the amount of charge at the 
center o, we can solve the problem of the potential of a point charge 
q at 6, and a conducting sphere carrying any desired amount of charge. 

Problems 

1. An electron of charge —c (e = 1.60 X 10~ l “ coulomb) and mass m 
(to = 9.1 X 10” :!1 kg) moves in a magnetic field B (in gausses) in a plane at right 
angles to B. Show that it moves in a circular path of radius p, and find the value 
of the product Bp in terms of the velocity of the electron. 

2. An electron moves in a uniform electric field directed along the x axis, 
and a uniform magnetic field along the y axis. It starts from rest at the origin. 
Show that it moves in a cycloidal path, and find the drift velocity, or the velocity of 
the center of the rolling circle along the z axis. If the electric field is 10 4 volts/cm, 
and the magnetic field is 10 4 gausses, find the value of the drift velocity. 
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3. Find the angular velocity of rotation of an ion in a cyclotron. If the 
magnetic field is 15,000 gausses, what should be the wave length and frequency of 
the oscillator used to run the cyclotron, if it is accelerating protons? 

4. When the velocity of a particle approaches that of light, a force at right 
angles to the velocity produces an acceleration transverse to the velocity given by 
the equation F = mta, where m t is the transverse mass, equal to 

mp 

VT — t> 2 /c 2 

if m 0 is the rest mass, «the velocity, c the velocity of light. Discuss the cyclotron 
in the relativistic range, showing that the resonant frequency must change as the 
ions are speeded up. If protons are to be speeded up to 500 million electron volts, 
find the ratio by which the oscillator frequency must change as they accelerate from 
rest to their maximum speed. 

6. A charge e is located a distance d from an infinite conducting plane. Find 
how much work is required to remove the charge to infinite distance, against the 
attraction of its electric image. 

6. Find the potential at distance r from an infinitely long straight wire carrying 
a charge <r per unit length. 

7. Inside a hollow infinitely long grounded conducting cylinder of radius R 
is placed a thin charged wire, carrying a charge a per unit length. The wire is 
placed parallel to the axis of the cylinder, and distant by an amount d from the 
axis. Find the potential at points within the cylinder. [Hint: Use the construc¬ 
tion of Fig. 4 and of Eq. (4.1).] 

8 . Find the attraction between a charge q, and a grounded conducting sphere, 
as a function of the distance of q from the center of the sphere. 

9. Find the attraction between a charge q , and an uncharged, insulated, con¬ 
ducting sphere, as a function of the distance of q from the center of the sphere. 
(We shall prove later that, if the sphere is uncharged, the total fictitious charge 
located at points o and a in Fig. 4 must be zero.) 

10. Two parallel wires carry equal and opposite charges ±<r per unit length. 
Consider the intersections of the equipotentials with a plane normal to the wires, 
and the lines of force, which will lie in that plane. Show that the lines of force and 
equipotentials form two families of circles, orthogonal to each other. 

11. A charge q is placed inside a hollow metal sphere of radius a at a distance r 
from the center. Show that the force acting on the charge q is given by 

et _ _ q 2 ar _ 

4tt«o( a 2 - r 2 ) 2 

12. Find the force per unit length with which two long parallel metal cylinders, 
each of diameter “a and separated by a distance d between centers, attract each 
other, if they carry a charge per unit length ±<r, respectively. 



CHAPTER II 
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The simple concept of lines of force and of equipotential surfaces, 
which we introduced in the preceding chapter, becomes of real value 
when it forms the basis of an analytical treatment founded on differ¬ 
ential equations. This treatment, worked out originally by Gauss, 
Poisson, Laplace, and others, can be used either for electrostatics, to 
which we apply it, or for gravitational forces, which consist of inverse- 
square forces, as electrostatic forces do. It was actually for the 
gravitational case that much of the mathematical work of the present 
chapter was originally developed. We shall find, however, that it 
forms a very natural mathematical framework for electrostatics, and 
that it leads us into certain equations that will later underlie the 
general case of electromagnetism, not merely the static problem. We 
develop the equations in the present chapter, and solve some electro¬ 
static problems by means of the solution of partial differential equa¬ 
tions in the next chapter. 

1. Gauss’s Theorem.—Suppose we set up a closed surface S , 
enclosing a volume V within which are certain charges # 1 , g 2 , • . . • 
We now form a surface integral over S, as follows: We take an element 
da of the area of the surface. If n is unit vector along the outer normal, 
and if E is the value of the electric intensity at da, the surface integral 
is E • n da, integrated over the surface S. Gauss’s theorem is then 
the following: 


J E • n da = Y 


( 1 . 1 ) 


That is, the surface integral of the normal component of E over a 
surface S equals l/«o times the total charge included within the surface. 
Let us prove this important theorem. First we prove it for a single 
point charge; then by addition the theorem obviously holds for any 
collection of point charges. For a single charge, consider an element 
da of surface, as shown in Fig. 5. The contribution of this element 
to E*nda is \E\ da cos (E,n). But da cos (E,n) is the projection 
of da on the plane normal to E, or is r 2 dec, where dec is the solid angle 
of the cone subtended by da. Also \E\ = q/^reor 2 . Thus the contri- 
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bution of the element to the surface integral is 


AireoT 2 


r 2 do) 


= -S-da. 
4jT€q 



Fig. 5. —Construction for Gauss’s theorem. 


Integrating over S involves integrating dec over the complete solid 
angle 4x, resulting in q/e 0 , in agreement with (1.1). Thus, summing 

over an arbitrary number of 
point charges, Gauss’s theorem is 
proved. 

Examples of the application 
of Gauss’s theorem are obvious. 
First consider a positive point charge, and a sphere of radius r sur¬ 
rounding it. By symmetry, the field points out along the radius, and 
depends only on r. Thus Gauss’s theorem states that \E\ times the 
area, 4?rr 2 , equals q/e o, or \E\ = g/4ire 0 r 2 . This is trivial, but we can 
equally well use Gauss’s theorem to find the field of a uniformly 
charged line and plane, as in Chap. I, Sec. 2. For a line carrying a 
charge cr per unit length, let the surface S be the surface of a circular 
cylinder of unit length, radius r, with the line as the axis. By sym¬ 
metry, the field points radially 
outward, and depends only on r. 

Thus the field is parallel to the 
flat ends of the cylinder, which do 
not contribute to the surface in¬ 
tegral ; while over the curved face 
the integral is \E\ times the area 
2xr. Since this equals tr/eo, we 
have \E\ — <r/27T€ 0 r. For the 
charged plane, take S as the sur¬ 
face of a box, as shown in Fig. 6, 
enclosing unit area of the surface, 
the two faces parallel to the plane. 



Fig. 6.- 


-Gauss’s theorem for field of 
charged plane. 


By symmetry, E will be normal to 
Hence only these faces will con¬ 
tribute to the integral. Each has unit area; thus 2\E\ = <r/* 0 , \E\ = 
<r/2eo. Thus we verify the results of Chap. I, Sec. 2, by very simple 
methods. 

As a final example, consider the surface of a conductor carrying a 
surface charge c per unit area. We set up a surface as in Fig. 6, but 
now to the left of the surface, or inside the conductor, E is zero. Our 
volume still encloses charge <r, but now the surface integral has a con¬ 
tribution only from the right-hand face. Thus \E\ — <r/e 0 on the right. 
We note that this case can be found from the earlier one by super- 
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posing a constant field <r/2e 0 pointing to the right on our earlier solu¬ 
tion. This field cancels the field originally present to the left of the 
plane, and doubles the field originally present on the right. Such a 
constant field makes no contribution to the surface integral concerned 
in Gauss’s theorem, and in fact any constant field, or more generally 
the field of any charge distribution outside the surface, may be added 
to our original solution of the problem, without contradicting Gauss’s 
theorem. It is thus clear that Gauss’s theorem by itself is not enough 
to determine the field completely. In our first case of the charged 
plane we used the additional fact of symmetry, and in the second case 
of the charged surface of the conductor we used the fact that the field 
must be zero within the conductor. In any actual case there would 
be information sufficient to determine the field definitely. We note 
an important result common to all these possible solutions for the 
charged plane, however: the normal component of E undergoes a 
discontinuity of <r/eo in every case in going through a surface charge <r. 
Thus in the case of the charged plane the component to the right 
of the field changes from —a/ 2« 0 to cr/2e 0 , and in the case of the con¬ 
ducting surface it changes from zero to <r/e 0 . It is clear from Gauss’s 
theorem that this relation is quite general. It has a valuable applica¬ 
tion, which will be brought out in the problems: if we know the poten¬ 
tial and field at every point of space, including the neighborhood of 
conductors, we can calculate the surface-charge density on the surface 
of the conductors, since we know the normal component of field. We 
thus see, as we pointed out in the preceding chapter, that a solution 
for a potential that satisfies the correct boundary conditions on the 
surface of all conductors allows us to find the distribution of charge, 
though otherwise we should not know how the charge was distributed. 

2. Capacity of Condensers.—A condenser consists of two con¬ 
ductors carrying equal and opposite charges; its capacity C is defined 
as the charge on one of the conductors, divided by the difference of 
potential between them. In a number of simple cases we can easily 
get the capacity by use of Gauss’s theorem. Thus consider two 
parallel plates of area A , distance of separation d. The field E will be 
normal to the surfaces (if we neglect edge effects). If there is surface 
charge a on one plate, —<r on the other, there will be a field E — <r/e o 
between; and the difference of potential, or voltage V, between them, 
will be Ed — <rd/e o. The charge Q on the plates will be crA. Thus 
the capacity will be 


crA cqA 
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From this we see the physical meaning of eo: it is the capacity of a 
condenser whose area is 1 sq m, with two plates 1 m apart. 

Next consider a cylindrical condenser, consisting of two concentric 
cylinders of radii ri, r 2 (r 2 > ri), of length!/. Let the inner conductor 
carry charge —Q, the outer one Q. The field between the two, by 
symmetry, must be radially inward, and hence by Gauss’s theorem 
must be — (Q/L)/2r€or, just as if the charge were concentrated along 
the axis. The potential at distance r is then the quantity whose 
negative gradient is the field, or is [(Q/jL)/ 27T6 0 ] In r. Thus the differ¬ 
ence of potential is 



and the capacity is 

Q 2tT€oL 

~~ In r a /ri 

For a spherical condenser, consisting of two concentric spheres of 
radii ri and r 2 (r 2 > ri), carrying charges —Q and Q, respectively, the 
field must be radially inward, and hence by Gauss’s theorem must be 
— Q/4ire 0 r 2 j as if the charge were concentrated at the center. The 
potential at distance r is then — Q/47re 0 r, and the difference of potential 
between conductors is (Q/47re 0 )[(lAi) — (l/r 2 )], so that the capacity is 

sy _ 47T60 

(1/ri) ~ (1 /r t ) 

It is interesting to note that, as r 2 —> °°, the capacity stays finite; 
this value is often referred to as the capacity of a sphere. Thus a 



Fig, 7, —Capacity of parallel cylinder. 


sphere 1 m in radius has a capacity 4xeo with respect to an infinitely 
. distant sphere. 

As a final example of capacity we take a somewhat more compli¬ 
cated case, two parallel cylinders, each of radius a , length L , at dis¬ 
tance D apart between their centers, where we assume the length 
to be so great that we can neglect end effects. We shall first show 
that equal and opposite charges ± Q distributed uniformly over the two 
lines 1 and 2 in Fig. 7, where 8/a = a/d, will result in the two cylin- 
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drical conductors being equipotentials. To prove this, we use the 
geometrical theorem of Eq. (4.1), Chap. I, though for quite a different 
purpose. From it we see that 8/a = a/d = ri/r 2 . A line charge +Q 
at 1, and —Q at 2, will have a potential equal to 


Q/L 

2ireo 


(In r i 


In r 2 ) = 


%JA In — 
2x€q r 2 


Q/L , a 
27reo <2 


at every point of the left-hand cylinder, and the negative of this 
at every point of the right-hand cylinder. Thus the cylinders are 
equipotentials, and can be replaced by conductors. By Gauss’s 
theorem, the total charge on each cylinder is +Q; for the surface 
integral of the normal component of E over the surface of the cylinder 
must be 1/eo times the total charge inside, which is ±Q for the line 
charges, and hence for the cylinders that produce equal fields at 
external points. Thus the difference of potential between the cylinders 
is [(Q/L)/ire o] In a/d, so that the capacity is 

^ _ X 60 -L 

— In a/d 

This can be put in terms of D , the distance between centers, rather 
than d. We have 

P—— = — = e ve ° L/c + e~ irt ° L/C — 2 cosh —fr'j 

a a a a d L 


or 


C = 


XCO-L 


cosh -1 D/2a 


( 2 . 1 ) 


3. Poisson’s Equation and Laplace’s Equation.—Suppose that 
within a volume V we have, not a discrete set of point charges, but a 
continuous distribution of volume charge, such as we should have for 
instance from an electronic space charge, if our scale of measurement 
is large enough compared with the interelectronic distance. We can 
then define a charge density p as the charge per unit volume (in 
coulombs per cubic meter in the mks system). Then Gauss’s theorem 
is replaced by 

f E • n da — — / p dv, OM) 

J s *o J v 

where the volume integral is over the volume V enclosed by the surface 
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S . This must hold for any arbitrary surface and volume. Now we 
may apply the divergence theorem of vector analysis, discussed in 
Appendix I. This states that, if F is any vector function of position, 
and div F, or v • F» is its divergence, then 



F • n da 



div F dv. 


(3.2) 


Applying to (3.1), we have 


1 


€0 




div E dv, 


where the equation holds for any arbitrary volume V. This cannot 
be the case unless the integrands are equal; that is, unless 


div E = —• (3.3) 

€0 

This is one of the fundamental equations of electromagnetic theory. 
We now combine with Eq. (3.3) of Chap. I, E = — grad <p, where <p is 
the potential. We then have 

VV = — —■ (3.4) 

This is Poisson’s equation. In a region where there is no volume 
charge, so that p = 0, Poisson’s equation reduces to 

VV = 0, (3.5) 

which is Laplace’s equation, and is one of the most important equa¬ 
tions in mathematical physics. It is as a result of its appearance in 
this equation that the operator v 2 is called the “Laplacian.” 

As an illustration of the variety of problems in which Laplace’s 
equation is encountered, we may mention static elasticity; elastic 
vibrations are governed by a wave equation, and when the time varia¬ 
tion in the wave equation is set equal to zero, Laplace’s equation 
results. Another illustration is the steady-state flow of heat, where 
the temperature satisfies Laplace’s equation. Diffusion, and the 
steady flow of electricity, are two other examples. Many more com¬ 
plicated equations, such as the wave equations of wave mechanics, 
contain the Laplacian operator, and are solved by analogy with 
Laplace’s equation, so that a knowledge of the solutions of Laplace’s 
equation proves to underlie a great deal of mathematical physics. 

‘ 4. Green’s Theorem, and the Solution of Poisson’s Equation in an 
Unbounded Region. —A partial differential equation, like Poisson’s 
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or Laplace’s equation, has a great variety of solutions. In the first 
place, Poisson’s equation, being a linear inhomogeneous differential 
equation (that is, containing the Laplacian, involving first powers of 
derivatives of <?, and containing the term — p/«o involving terms 
independent of <p), has a general solution that can be formed as follows: 
we find a particular solution, or particular integral, of the inhomo¬ 
geneous equation. We also find a general solution of the related 
homogeneous equation, formed by setting the term independent of 
ip equal to zero {that is, in this ease, of .Laplace’s equation (3.5)]. 
'phis homogeneous equation is called the “auxiliary equation,” and a 
general solution of it is called the “complementary function.” Then 
the sum of the particular integral, and of the complementary function, 
is in the first place a solution of the inhomogeneous equation, as we 
can see at once by substituting it in the equation. Furthermore, 
if the complementary function has sufficient, arbitrariness in it, we 
can use it to satisfy arbitrary boundary conditions. Thus the sum 
of complementary function and particular integral is a complete 
solution of the inhomogeneous equation. In the next chapter we shall 
take up some methods for solving Laplace’s equation, and satisfying 
arbitrary boundary conditions with it. We shall find that, to do this, 
we require an infinite number of arbitrary constants, or an arbitrary 
function, since we are dealing with a partial differential equation. 
In the present, chapter we shall demonstrate a simple method, (‘ailed 
“Green's method,” for finding a particular integral of (3.4). By 
combining tin* two, we shall then be in position to get a complete 
solution of Poisson’s equation. 

The first step in deriving Vlreen’s solution of Poisson’s equation 
is to prove Green's theorem, an important, theorem in vector analysis, 
discussed in Appendix I. We prove it from the divergency theorem, 
(3.2), bv setting F ** <p grad VL where f and yp are scalars. Then (3.2) 
becomes 

J ^ grad \p • n da :r J v (y?V 2 i/' -j- grad <p • grad yp) dv. (4.1) 

'phis is one form of Green’s theorem. To get the more familiar form, 
wo next writ** the same expression with <p and yp interchanged, and 
subtract, obtaining 


(f grad \p • n — yp grad f * n) da ■- J y (<pV 2x P ^ V) d v - (4-2) 

We can now use this theorem to obtain (ireen’s solution of Poisson s 
equation. Let us apply the theorem to the whole of space outside 
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a very* small sphere of radius R surrounding a point P. Let \p = 1/r, 
where r is the distance from P to the point x , y, z. Then vy = 0 
everywhere outside the sphere of radius R; for \p is the potential of 
a charge located at P, and hence satisfies Laplace's equation every¬ 
where except where the charge is located, or at P. The right side of 

(4.2) is then 



vy 

r 


dv , 


integrated over all space except the small sphere. For the left side, 
grad \f/ • n — — d(l/r)/dr — 1/r 2 . Thus the left side is 


f% da + J 7 

On the surface of the sphere, r = R, / (<p/r 2 ) da is then 



where ^ is the average value of over the sphere, or is approximately 
the value of <p at P. The second term is (1 / R ) ( d <p/dr) (4ttP 2 ) , which 
goes to zero as P approaches zero. Thus in the limit as R goes to 
zero, we have 


4:TT<p = 



(4.3) 


This is a mathematical theorem holding for any function <p, where the 
volume integral on the right side is extended over all space except 
an infinitesimal sphere surrounding the point P, where <p is the value 
of <p at P, and where r is the distance from P to x, y, z . Now let us 
combine this with Poisson's equation (3.4). Then, for the potential 
</? of the charge p, we have 


<p = 



(4.4) 


This is Green’s solution of Poisson’s equation. It is identical with 
Eq. (3.5) of Chap. I, which we have already obtained; it is simply 
a more elegant method of deriving that result. We shall find this 
more elegant method to be useful, in the next chapter, in finding 
the potential in a bounded region of space, rather than an unbounded 
region as we have here. 

6. Direct Solution of Poisson's Equation. —Green’s method forms 
one way of solving Poisson’s equation. However, sometimes p has a 
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sufficiently simple form so that, wo (‘an solve it directly, regarding 
it ns a problem in differential equations. Thus, for instance, suppose 
p is a function of one variable only, say x } independent, of y and z . 
Then the potential may also be chosen to be independent, of y and 
z. In this case Poisson's equation becomes an ordinary differential 
equation: 

dV pt.r) 

tiX' «u 


'phis can be solved as an ordinary differential equation for <p as a 
function of .*\ merely by integrating twice with respect, to x. As a 
very simple ease, let p be a constant. Then we have simply 




K,,r 


1 p .. 

r r* 
«) * 


( 6 . 1 ) 


Such solutions of Poisson's equation are useful in space-charge prob¬ 
lems, In studying space-charge limited emission from cathodes of 
various shapes, we use Poisson's equation, expressed in variables 
appropriate to the eatliode geometry, to find the potential from the 
charge distribution. This is combined with the dynamical equations 
of motion of the electrons, Newton's law giving the acceleration in 
terms of the force derived from tin* potential, and in this way we 
get a complete set of equations to determine both charge density and 
potential. 

We must remember, as has been mentioned curlier, that the 
solution, us found for instance in to. 1), is by no men,ns a general 
solution of Poisson's equation, even though it contains the. t wo arbi¬ 
trary constants, y and /•.’». For it is clear that, we can add to it. 
any solution of Laplace's equation, acting as the complementary 
function. It is true that our function y'n /*o.r is a solution oi 
Laplace's equation, but it is tar from a general solution, since if, is a 
function of x only. We shall examine general solutions of Laplaces 
equation in the next chapter, and shall show that a great variety oi 
such solutions exists. We shall show that, by using a. general solution 
of Laplace's equation, we can satisfy general boundary eondithms: 
wy <’an find a solution such that y* reduces to prescribed values over the 
boundary of the region in which we can satisfy Poisson s or Laplace h 
equation. 

Problem# 

1. < .iveii a spheiirnl distrihut ion of eharge, in whieh tin* density is n function 
of / Prove that the held at miv point is what would he obtained by imagining a 
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sphere drawn through the point, with its center at the origin, all the charge within 
the sphere concentrated at the center, and all the charge outside removed. Apply 
this result to the gravitational case, showing that the earth acts on bodies at its 
surface as if its mass were concentrated at the center. 

2. Given a sphere filled with charge of constant density. Prove that, at points 
within the sphere, the field is directly proportional to the distance from the center. 

3. Find the surface density induced by a charge on a plane conductor. Show 
by direct integration that the total induced surface charge equals the inducing 
charge in magnitude. 

4. For a certain spherical distribution of charge, the potential is given by 


-qe~ ar 


4:T€qT 


) 


where q , a are constants. 


Find a distribution of charge that will produce 


this potential, finding charge density as function of r, and the charge contained 
between r and r + dr. Consider whether a point charge at the origin is also 
required to produce the potential. The resulting charge distribution represents 
roughly the charge distribution within an atom. 

5. There are certain charges and conductors in an electrostatic field, whose 
potential is <p . Show that the surface density of charge on the surface of a con- 
ld<p 

ductor is —where n is the normal pointing out of the conductor. 

€ 0 dn r o 


6. Prove that the potential cannot have a minimum in an uncharged region of 
space. Prove therefore that a point charge cannot be in stable equilibrium under 
the action of electrostatic forces in an uncharged region. 

7. A certain vacuum tube contains a cylindrical cathode of radius n. Sur¬ 
rounding it is a space-charge sheath, of constant density —p (where p is positive), 
extending to a larger radius r 2 . The anode is a cylinder of still larger radius r h 
If the cathode is at potential <pi, the anode at potential find the potential as a 
function of r, both inside and outside the space charge. 

8. Discuss the charge distribution giving rise to equipotentials given by the 


real part of 


In [sin ir(x +jy)} - constant. 

^7T€o 


What is the form of these equi 


potentials for large values of yl Sketch the lines of force and the equipotentials. 
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A partial diflerential equation, such as Laplace's equation, has a 
great variety of solutions. The type of solution to be used depends 
largely on the type of boundary condition that must be satisfied. 
One method of solution is an extension of Green's method; we meet 
it later in this ehapfer. Another method is called “separation of 
variables." This involves finding solutions that are the product 
of a separate function of each of the three coordinates. It is a common 
method for solution of similar equations, and is in fact the moat 
powerful method available for the purpose. Different solutions can 
be found, depending on whether we work in rectangular, polar, or 
other coordinate systems. We start with rectangular coordinates, for 
a simple illustration of the method. 

1. Solution of Laplace’s Equation in Rectangular Coordinates by 
Separation of Variables. Working in rectangular coordinates .r, 7/, 2, 
let us try to find a solution of vV 0 in which is a product of a 
function .V of x, a function V of //, and a function Z of 1 (this is certainly 
not the most general solution, but we shall find that solutions of this 
type exist). 'Then Laplace's equation becomes 


t p V tpy 

i A * * 1 A A } ,, 

tls' dy x 


V V A 

A 5 . „ 0 . 

dz 3 


We divide by XYZ, obtaining 


I ti r .Y 1 tl*Y 1 d'Z 
X tLr 1 Y ihf 1 £ dz” • 

In this equation, the lirst term is a function of x only, the second a 
function of y only, the third a function of z only. It is clearly impos¬ 
sible that the sum of these should be zero independent of x, y t z , unless 
each term separately is a constant, and the constants add to zero. 
That is, we must have 



h\ 


1 d'Z 

z dz* 

20 




1 f/’.Y 

X il.r’’ 


! rf’Y 

Y < 111 ’ 


„■> 4 1,7 .4 r i „ 0. ( 1 . 1 ) 
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From these we then have three ordinary differential equations, 


(PX 


dx 2 

whose solutions are 


= a 2 X, 


,2 u * y 


dy> 


d*Z 
dz 2 


= c 2 Z, 


X = A x e ax + A 2 e~ aa: , 
F = B x e^ + J5 2 e-^, 
Z = C\e cz + C 2 e~ cs , 


where as a result of (1.1), some of the constants a, b, c must be real 
and some imaginary. Thus some of the functions X, F, Z vary 
exponentially with the arguments, the others sinusoidally. The prod¬ 
uct XYZ is now a particular solution of Laplace's equation. 

Any linear combination of such particular solutions is also a 
solution. Thus as a general solution we have 


<p = 2 (A X e ax + A 2 e~ ax )(B x e bv + B 2 e~ bv )(C x e cz + C 2 e~ cz ) 

where the summation is over an infinite number of terms, each with 
separate constants Ai, A 2 , B\, B 2 , C i, C 2 , a, b, c, subject only to the 
relation a 2 + b z + c 2 = 0. Thus the general solution has an infinite 
number of arbitrary constants. This is characteristic of partial differ¬ 
ential equations. The constants are to be chosen so as to fit the 
boundary conditions. We shall use rectangular coordinates if the 
boundary conditions are imposed over the surface of a rectangular 
region. For instance, suppose we have two planes, x — 0 and x = d, 
on which the potentials are given as functions of y, independent of 
z; we might have a composite electrode on each of these planes, 
different strips being maintained at different potentials by batteries. 
Suppose we wish to find the potential in the region between the 
planes. Clearly we can let <p be independent of z, so that c = 0, and 
a 2 = — 6 2 . Then we may write 

<p = 2(Aie ax + A 2 e~ ax ) (B x e iay + Bie~ iay ). 
where j = \/ — 1. When x — 0, this becomes 

2 (A i + A 2 ) (B ie iav -J- B , 
and when x — d it is 


2 (A ie ad + A 2 e~ ad ) {B x eJ ay + B 2 e~ iay ). 

Now a sum of an infinite number of terms of the form 2De iay is one 
way of writing a Fourier series, which we discuss in Appendix III. 
Thus at x — 0 the potential must be given by a Fourier series with 
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coefficients (A i + At)B x for the term in e inv , and at a: = d it is given 
by another Fourier series with coefficients (A ic ad + A^cr^Bx. By 
the theory of Fourier series we can find these coefficients, and hence 
the quantities A Jh, A»B X (and similarly A X B> 2 , A^B^). Thus we can 
get t he infinite set of arbitrary constants, from the boundary conditions. 

The method we have used above, for satisfying boundary condi¬ 
tions in Laplace’s equation, can he used also in discussing Poisson’s 
equation. Thus suppose we have the same problem as above, two 
parallel plant's on which the potentials are to be prescribed functions 
of ?/, but that between the planes we have certain charge distributions 
that arc assumed given. To keep the simplicity of our problem 
resulting from the independence of z, these charge distributions as 
well should be independent of z. We now know, from the two pre¬ 
ceding chapters, how to find a particular solution of Poisson’s equation 
for the potential resulting from these charge distributions. In par¬ 
ticular, for this problem, the charge can be considered as made up 
of uniform charges along lines parallel to the z axis, and we have 
already seen that such a charged line has a logarithmic potential, so 
that all we need to do is to sum such logarithmic potentials arising 
from all linear charges. The resulting potential, however, will not 
have t he correct, values along the two planes. We then sot up the 
complementary function, a solution of Laplace’s equation, built up 
as in the solution we have worked out in this section, and reducing to 
boundary values on the surfaces which, added to the potential already 
found from our particular solution of Poisson’s equation, will give the 
desired values. This is a problem of the type we have already con¬ 
sidered, so that we see that the determination of the complementary 
function for solving Poisson’s equation is a problem exactly analogous 
to solving Laplace's equation directly. 

2. Laplace’s Equation in Spherical Coordinates. The method of 
separation of variables, which we have applied in rectangular coor¬ 
dinates in the preceding section, can be applied as well in a number 
of other coordinate systems, notably in cylindrical and spherical 
coordinates, and as a less familiar case in ellipsoidal coordinates. 
The ease of spherical coordinates is the more important in practice, 
for it is used for the field of point, charges, dipoles, and other charge 
distributions concentrated near a point. In spherical polar coordi¬ 
nates r, 0, <p t Laplace’s equation for the* potential \J/ (we use this symbol, 
so as not. to confuse it. with the coordinate y?) is 

1 o / <V\ 1 0 / . 0 +\ , l OS n 

r 2 Or \ Or) r 2 sin 0 00 \ S,n oo) r 2 sin 2 0 0<p~ ’ 
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as we see in Appendix IV, in which we discuss vector operations in 
curvilinear coordinates. We assume that ^ is a product of three 
functions, one a function of r, one a function of 0, one a function of <p. 
That is, we assume — R(r)Q(6)3>(cp). Inserting this function, and 
dividing by the product RQ&, we have 


1 1 d / 2 dR\ 
R r 2 dr \ dr ) 


1 1 d 
"^0r 2 sin 0 dd 



1 1 d 2 3> 

4> r 2 sin 2 0 dtp 2 



If we multiply by r 2 sin 2 6, the first term and the second will depend 
on r and 0 only, the last on <p only. This is impossible unless each 
of these is a constant, and the constants add to zero. If the last term 
is — m 2 , we shall have 


JL I d / 2 d/A 

R r 2 dr \ dr ) 


d 2 4> 

dtp 2 


4 - = 0 


1 1 d_ 

0 r 2 sin 0 dd 



( 2 . 1 ) 


m 2 

r 2 sin 2 0 


= 0. 


( 2 . 2 ) 


If we multiply by r 2 , the first term will depend only on r, the second 
and third only on 6. Thus again each of these must be a constant. 
Let the first term be 1(1 + 1). Then 


1 d 
sin 6 dd 



1(1 + 1) 

~2 



1(1 + 1 ) 


m 2 

sin 2 0 _ 


0 = 0 . 


(2.3) 

(2.4) 


We have now separated the variables, in the sense that we have three 
ordinary differential equations, (2.3), (2.4), (2.1), for the functions 
R, 0, <&. 

3. Spherical Harmonics. —The solution of these equations is not 
difficult. For (2.3) we may try the assumption R = r 7 ‘. Then we 
find that we have a solution if n(n + 1) = 1(1 4" 1), whose solutions 
are n = l, or n = — (Z + 1). Thus 


R 


ar l 4" 


b 


where a, b are constants of integration. Equation (2.4) is Legendre’s 
equation. It can be solved by making the substitution 

0 = sin m d(A o 4- Ai cos 0 + A 2 cos 2 0 4- * • •)• 

We then find, on substituting in (2.4), that the following relations 
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A 0 [l(l + 1 ) — m(m 4* 1)] 4" 2.A2 = 0 
Ai[l(l + l)-(m + 1 )(m + 2)] + 2 • ZA 3 = 0 
A 2 [Z(Z + l)-(m + 2) (m + 3)] + 3 • 4A 4 = 0 


These allow us to solve for all the A’s in terms of Ao and Ax, which are 
constants of integration. In this way we find 


e = A 0 sin m d < 1 


[1(1 + 1) — m(m + 1)] 


cos 2 0 


[1(1 + 1) - m(m + l)]g(Z + 1) - (m + 2)(m + 3)] 


cos 4 0 4* 


( _ [Z(Z 4-1) — ( 7 ?! 4~ l)(m 4* 2)] , _ 

4- Ax sm m 0 cos 0 - ——-—---—-- cos 3 0 

, [Z(Z 4- 1) - (m 4- 1 )(m + 2)][Z(Z + l)-(m + 3)(m 4- 4)] 


cos 6 0 


* • - J* (3-1) 

Finally the solution of (2.1) is 

<3? = C sin m<p 4- D cos ra«p, (3.2) 

where C, D are constants of integration. 

We now find that m and Z must be chosen to be integers to satisfy 
certain conditions. Unless m is an integer, an increase of <p by 2ir, 
which brings us back to the same point of space, would lead to a 
different value of <$; thus m is an integer, provided that we are solving 
Laplace’s equation in a region that includes all values of <p. Next 
consider (3.1). If the series do not break off, they can be shown to 
diverge when cos 0 — 4 1, or along the axis of coordinates. To show 
that this is plausible, we find the ratio of the term in cos p 6 to that in 
cos p “ 2 0. This ratio is 


[1( 1 + 1) - (m + y - 2Km 4- P “ 1)1 

V(V - !) 


cos 2 6, 


which approaches — cos 2 & in the limit where p is very large. Thus 
for cos 6 — ± 1, this ratio approaches — 1, so that the terms of the 
series oscillate without decreasing in magnitude. Under these circum¬ 
stances it is possible, though not necessary, for a series to diverge. It 
is also possible for the series to converge, but a closer examination 
than we shall give shows in the present case that the series actually 
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diverges, and the series would become infinite for cos 0 = ± 1. If 
we are solving Laplace’s equation in a region including the axis, the 
potential certainly cannot become infinite along the axis. Thus the 
series cannot be allowed to diverge. This can be prevented only if l is 
an integer; for then one of the series (3-1) breaks off to form a poly¬ 
nomial, which is finite for all values of 0. The corresponding poly¬ 
nomials are called “associated Legendre functions,” and are commonly 
denoted by Pf(cos 0), provided that the constant factor multiplying 
them is properly chosen. These constant factors, and additional 
properties of the associated Legendre functions, are taken up in 
Appendix V. The product of sin m<p or cos m<p by P™(cos 0) is called 
a “spherical harmonic.” We then find as our general solution 




Pp(cos 0)(C sin m<p + D cos m<p). 


(3.3) 


Here, as with the rectangular case, we can add an infinite number of 
such terms to get a general solution, and the constants can be chosen 
so as to fit boundary conditions. For instance, we may solve the 
problem of the potential between two concentric spheres of radii r J} 
r 2 , where the potential is given by specified functions of 0, <p on each 
of the spheres. The method of satisfying these boundary conditions 
is taken up in Appendix V. 

4. Simple Solutions of Laplace’s Equation in Spherical Coordinates. 

We have derived a general solution (3.3) of Laplace’s equation. This 
solution is of great value in complicated problems that can be expressed 
in terms of spherical coordinates. However, in certain simple cases 
the solution reduces to very simple forms, which are of great impor¬ 
tance. Thus let us ask for a solution corresponding to l — 0, m — 0. 
In this case, by (3.1) and (3.2), we see that the functions of 0 and <p are 
constants, so that \f/ reduces to a + b/r. The first term is a constant 
of integration, the second the potential of a charge located at the 
origin. We can determine the constant b if we know the charge, by 
using Gauss’s theorem, or the fact that the potential of a charge q 
at the origin is q/Aireor, so that b = q!4c7T€o. 

The next simplest solution is for l — 1, m = 0. In this case the 
first sum of (3.1) fails to break off, and diverges, so that we must set 
its coefficient equal to zero in our solution; but the second breaks off 
after the first term, and is a constant times cos 0. Thus for this case 
we have 



^ar + 



cos 0. 


(4.1) 
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The firnt term, proportional to r cos 0, in dimply proportional to z, 
the coordinate along the axis of the spherical coordinates; thus its 
held is a constant along t he z axis, which is surely a solution of Laplace's 
equation. The next term, proportional to cos 0/r 2 , is particularly 
interesting: it is the potential of a dipole, which we shall consider in 
the next sertion. Before proceeding to it, we note that (4.1) can he 
used to solve an interesting problem: the potential of a grounded 
conducting sphere in a uniform external held. Let the external held 
he along the i axis, and let its magnitude he A’. Then at large dis- 
tances from the sphere we must have $ — AY ** — AY cos 6 , or wo 

must have a ■* — A’ in t LI). Next let the radius of the sphere be R t 
so that the potential must be zero on the sphere. We can arrange to 
have the potential of M.l) satisfy this condition, by making 

ah* f j* 0, b A’A >3 . 

'Thus (4.1 I reduces to 

/ /*’ ft ; ’\ 

+ [ '*> I JcosO, (4.2) 

which is a solution of Laplace’s equation, reduces to zero on the 
surface of the conducting sphere, and reduces to the correct, value 
at iniinite distance*. 



5. The Dipole and the Double Layer. A dipole by dehnition con¬ 
sists of two equal and opposite charges * // separated by a distanced. 
The potential at point f* is then 

1 \ 
r 1 i/ cos fl/ 

where r, II are as shown in Kig. S. If »l is small, this is approximately 




11 tl i’«ts a 

1 Tte., t' 
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where n is the unit vector along the axis of the dipole. The product 
qd is called the “dipole moment/’ and is denoted by m. Thus the 
potential of the dipole is 


, m cos 6 


(5.1) 


In speaking of dipoles, it is customary to assume that q is infinitely 
large, d infinitesimally small, in such a way that the dipole moment 
m is finite, and (5.1) forms the exact potential. By comparison with 
(4.1) we see that the second term of that expression is the potential 
of a dipole of moment 4 tt€o b, located at the origin. 

By analogy with this treatment of the dipole and its potential, one 
can set up solutions of Laplace’s equations similar to (4.1), but cor¬ 
responding to larger values of l than unity. One can then interpret 
these solutions as being the potentials of more complicated distribu¬ 
tions of charge at the origin. In this way one arrives at the definitions 
of quadripoles, and a whole series of higher multipoles, corresponding 
to successively larger values of l. We consider these higher multi¬ 
poles in Appendix VI. They are of particular importance in studying 
the external electric fields (and, as we shall see later, of magnetic fields) 
of atoms and molecules on the one hand, and of atomic nuclei and 
their elementary constituents on the other. We often are interested 
in their fields at a large distance compared with their dimensions, and 
then they act to a good approximation like multipoles. Thus, for 
instance, a diatomic molecule like HC1, one of whose atoms (H) tends 
to acquire a positive charge, the other (Cl) a negative charge, has a 
field at a distance that consists of a dipole as its leading term. On the 
other hand, a molecule like N* f which is symmetrical, has no dipole 
moment. We notice from (3.3) that the higher l is, the more rapidly 
the potential falls off with distance, varying as r~ (Z+1) . Thus, since 
.the higher multipoles correspond to higher values of l, their fields 
fall off more rapidly, and at large distances the field of the multipole 
of lowest l value predominates. For nuclei, the leading term in the 
field at large distances, aside from the electrostatic potential varying 
with 1/r resulting from the nuclear charge, is generally a magnetic 
rather than an electrical term. 

The reader might reasonably be surprised that in our present 
discussion we started by trying to solve Laplace’s equation, which 
is the special case of Poisson’s equation which we meet in a region in 
which there are no charges, and end up by finding the potential 
resulting from a point charge, dipole, or higher multipole. How, he 
might ask, did charges get into the theory? The answer is that our 
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spherical coordinate system introduces a singularity at the origin, and 
our final solutions, although they satisfy Laplace’s equation perfectly 
properly everywhere except at the origin, fail to satisfy it at the 
origin. The term in (3.3) in r l has no singularity at the origin, and 
if we know that there is no charge distribution of any sort at the origin, 
that term alone must be used in setting up a solution of Laplace’s 
equation. On the other hand, the terms in r l become infinite at 
infinite distance from the origin. Closer examination of them shows 
that they must be connected with the existence of infinite amounts of 
charge at infinite distance. Thus, to take a very simple case, suppose 
we have the potential r cos 0, corresponding to the first term of (4.1). 
This, as we have seen, is the potential of a uniform field, such as would 
be set up within an infinite parallel plate condenser. To set it up, we 
should then have to have equal and opposite charge distributions 
over two infinitely great condenser plates at infinite distance, and 
each would carry infinite charge. If, then, by some condition of a 
problem, we know that there is no charge at infinite distance, we must 
not use the terms in r l in the solution (3.3) of Laplace’s equation. Of 
course, if there is no charge at infinity, so that we do not use the terms 
in r l } and no charge at the origin, so that we do not use the terms in 
r —(i-H), an( j no charge anywhere between, as is implied by the fact that 
we are trying to solve Laplace’s equation, then there will be no field, 
the potential will be constant everywhere, and there is no problem to 
be solved. 

Returning to the discussion of dipoles, we have seen that (5.1) 
represents the potential of a dipole of moment m. Similarly the 
potential found in (4.2), for the problem of a grounded conducting 
sphere in a uniform external field, is the potential of the superposition 
of a uniform field, and of a dipole of moment 4we 0 R z E. We thus see 
that a conducting sphere of radius R acquires an induced dipole 
moment of this amount in an external field. When an object thus 
acquires a dipole moment, it is said to be "polarized,” and if the 
moment is proportional to the field, the ratio is defined as the polar¬ 
izability a. Thus we have 

m = cxE. (5.2) 

As we have just seen, the polarizability of a conducting sphere of radius 
R is 4tt€oR z . 

Often we have occasion to consider, not a single dipole, but a 
so-called "double layer,” a surface distribution of dipoles. Thus 
let two surfaces a distance d apart carry surface charges +<r per unit 
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area. Then we may say that the dipole moment per unit area is ad. 
We note that this double layer is equivalent to a parallel plate con¬ 
denser, and that the potential difference between the plates is ad/e 0 , or 
the dipole moment per unit area divided by eo. The field is zero out¬ 
side the double layer. We note that the normal component of E is 
discontinuous by amount a/e o at a charged surface (as seen in Sec. 1, 
Chap. II), and the potential is discontinuous by an amount (dipole 
moment per unit area) divided by « 0 at a double layer. 

6. Green’s Solution for a Bounded Region.—So far in this chapter 
we have been considering the direct solution of Laplace’s equation, 
by separation of variables, in rectangular or spherical coordinates. 
We now consider the other important solution, using Green’s method. 
We proceed as in Chap. II, Sec. 4, only we apply Green’s theorem to 
the volume V between the small sphere of radius R surrounding the 
point P, and a larger surface S'. Then, substituting ^ = 1/r in 
Green’s theorem, (4.2) of Chap. II, and proceeding as in the derivation 
of (4.3) of Chap. II, we have in addition a surface integral over S'. We 
then have in general 


4 ic<p = — 



vy 

r 


dv 



<p grad 


1 

r 


n-grad 

r 


<P * 


da. 


This, like (4.3) of Chap. II, is a general theorem holding for any function 
<p. If in particular vy = —p/e 0 , we have 


” / s ,fcG rad ^)-“-~ (grad v .”)] da. (6.1) 

This expresses <p at any point within V as the sum of two terms: first, 
the volume integral, representing the potential of the charges within 
V; secondly, the surface integral, which can be computed if the values 
of <p and its normal derivative are known at all points of the surface S'. 
Thus this theorem is in proper form to use in a case in which the 
behavior of the potential is known around the boundary of a region. 
Using the results of the preceding section, we can give a physical 
interpretation to the terms of the surface integral. Suppose we tried 
to set up such a distribution of surface-charge density, and such a 
double layer, on aS , that the field and potential within S' would have 
the values actually present in the problem, but so that the field and 
potential outside S' would be zero. The normal component of field 
on the inner side of S' would be — (grad <p) • n, so that the discon¬ 
tinuity in field at the surface is (grad <p ) • n. This discontinuity of 
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field would be produced a surface charge of density e 0 (grad <p) • n. 
The contribution of this charge to the potential would then be 

Je 0 (grad y) • n da 

Aire qT 

which is the second term of the surface integral in (6.1). Similarly 
the potential within S' is <p, so that the discontinuity is — <p, and the 
corresponding dipole moment per unit area to produce this discon¬ 
tinuity is — eo<p. The contribution of this to the potential is 

/i^„( grad 7)' nda ’ 

where the differentiation in the gradient is with respect to the coordi¬ 
nates of the point P. It is the negative of this if the differentiation is 
with respect to the coordinates x, y, z of the other end of the vector r, 
as in (6.1). The integral is then just the first term of the surface 
integral in (6.1). The distribution of surface charge and double 
layer over S' which we have just discussed is called “Green’s 
distribution.” We now see the significance of (6.1): the potential 
produced by the charge p within V, and Green’s distribution over S', 
is just the potential <p that we desire within V, but is zero everywhere 
outside. 

Problems 

1. It takes several volts of energy to remove an electron from the interior of a 
metal to the region outside. Find how many volts, if we represent the surface 
layer of the metal by a double layer consisting of two parallel sheets of charge; a 
sheet of negative electricity, of density as if there were electrons of charge 
1.60 X 10“ 19 coulomb, spread out uniformly with a density of one to a square 
4 X 10 -8 cm on a side; and inside that at a distance of 0.5 X 10~ 8 cm a similar 
sheet of positive charge. 

2. Find the components of electric field resulting from a dipole, both in rec¬ 
tangular and in spherical coordinates. 

3. A charge q is at a distance r from a polarizable dipole, whose moment is « 
times the field in which it is located, where a is the polarizability. Find the force 
of attraction between charge and dipole. 

4. Referring to Prob. 9, Chap. I, show that the force between a charge q and 
an uncharged conducting sphere can be found from the result of Prob. 3 above, 
taken together with the polarizability of a conducting sphere, provided that the 
distance between charge and sphere is large compared with the radius of the 
sphere. 

5. A dipole of fixed dipole moment is placed in an external electric field. 
Prove that there is a torque on the dipole proportional to its dipole moment and 
the magnitude of the electric field, and find how the torque depends on angle. 
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6. A dipole of fixed dipole moment is placed ii* an external electric field that 
is not constant. Prove that there is a force on the dipole depending on the rate of 
change of field with position, and find how this force depends on the orientation of 
the dipole and other features of the field. 

7 . Find the potential as a function of position in a region bounded by surfaces 
at x « 0, x - L, y = 0, extending to infinity along the y axis, subject to the 
boundary condition that the potential is zero along the two infinite surfaces x = 0, 
x = L, but that it is an arbitrary function of x along the surface from x - 0 to 
x = L, y = 0. Build up the solution out of individual solutions varying sinu¬ 
soidally with x, and exponentially with y , noting that they must decrease rather 
than increase exponentially as y increases. 

8 . Find the potential as a function of position within a sphere, if the surface of 
one hemisphere is kept at a potential +7, the other hemisphere at a potential - 7. 

9. Set up Laplace’s equation in cylindrical coordinates, and solve by separa¬ 
tion of variables. 

10 . A hollow pipe of circular cross section is infinitely long, and is grounded. 
A disk maintained at potential 7 practically closes the pipe at a certain point, but is 
insulated from it. Find the potential as a function of position within the pipe. 

11. Show that any solution of Laplace’s equation in two dimensions can be 

COS 

written as a linear combination of terms of the form r n '. n$, where n can be nnv 

sin ’ 

positive or negative integer. 

12. Given the two-dimensional potential on a circle of radius R about an origin 
of the form 7(0) (0 < 0 < 2 r), where 7(0) is continuous. Using Green’s theorem, 
show that the potential at any point ( r,<p ) inside the circle is given by 

V(r,if) = ^ f 2r _ V -MJL _ 

2t Jo jR 2 -f r 2 — 2ar cos (<p — 0) 

This is known as “Poisson’s integral.” What form would it take for points 
exterior to the circle? 

13 . Find the force and torque on a dipole in the field of a point charge. Find 
the force exerted by the dipole on the point charge, and verify Newton’s third law. 
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A dielectric, or insulator, is a material containing dipoles whose 
dipole moment is ordinarily proportional to the applied electric 
intensity. These dipoles can arise physically in two ways. In the 
first place, the molecules of any material are composed of positive 
nuclei, surrounded by rapidly moving negatively charged electrons. 
These electrons move very freely through the atom or molecule; they 
are prevented from escaping, however, by intense electric fields. 
When the molecule is placed in an external electric field, the electrons 
tend to be displaced by the field, much as the free electrons in a metal 
are displaced. They cannot travel outside the molecule, however; 
instead, they pile up on the surface of the molecule, resulting effec¬ 
tively in a surface charge, and the net result is that the molecule 
becomes polarized, or becomes a dipole. The situation is similar to 
that of the polarization of a conducting sphere in an external field, 
discussed in Chap. Ill, Sec. 5. We can even find a value for the 
polarizability of a molecule, using the formula a = 4 ire Q R z of that 
section, and assuming for R a molecular dimension, which is of the 
right order of magnitude. If there are N atoms or molecules in 
volume Vj so that the number per unit volume is N/V , and if each 
one acquires a moment aE, there will then be a dipole moment per 
unit volume of (N/V)dE. This moment is called the “polarization,’ 7 
and is denoted by P; it is a vector function of position, the polarization 
being in the same direction as E, in an isotropic dielectric. 

The other mechanism by which dipoles can be set up is found in 
certain materials containing polar molecules; that is, molecules that 
have dipole moments even in the absence of an external field. Thus a 
chemical substance like HC1, containing a positive and a negative 
ion, has dipole molecules, the H end of the molecule being positively, 
the Cl end negatively charged. In the absence of an external field, 
the molecules in gaseous or liquid HC1 will be oriented in random direc¬ 
tions, so that even though each molecule has a moment, the average 
moment per unit volume will be zero. An impressed field, however, 
tends to orient the molecules, and it can be shown that there is a net 
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dipole moment resulting from this, which is proportional to the field. 
Thus this effect, like the other, gives P proportional to E. There is a 
difference, however, which allows the effects to be separated. The 
induced dipoles are independent of temperature. The orientation of 
permanent dipoles, however, is opposed by temperature agitation, and 
by kinetic theory it can be shown that the resulting polarization in a 
given external field is inversely proportional to the absolute tempera¬ 
ture. By investigating the temperature variation of the dipole 
moment, then, we can experimentally separate the two effects, and 
can find values both for the polarizability of the individual molecules, 
and for their permanent dipole moments. For our present purposes, 
however, we need not distinguish between the two sources of 
dipoles, and need merely assume the existence of a polarization vector P, 
proportional to E, the constant of proportionality of course depending 
on the nature of the dielectric. 

1. The Polarization and the Displacement. —Let us consider a 
surface S bounding a volume V, within a dielectric. We start with the 
dielectric unpolarized, and then allow it to polarize. In this process, 
certain charges will have been carried across the surface S, so that, 
although originally the volume contains no net charge, there will be 
such a charge after polarization. Let us find this charge, by comput¬ 
ing the amount of charge flowing across an element da of the surface 
in the process of polarization. We may consider the polarization 
to consist of many dipoles, each of charge q, with displacement d 
(a vector, pointing along the direction of the dipole moment) between 
the equal and opposite charges. If n is the outer normal to V, the 
displacement of a charge q along n will be d • n. All those charges 
contained in the small volume of area da, height d • n, will be carried 
over da in the process of polarization. If there are N/V dipoles 
per unit volume, there will have been a charge of q(N/V)( d • n) da 
in this small volume, so that the total charge carried out over da 
will be ( N/V)(qd • n) da. But ( N/V)qd is simply the polarization P, 
the dipole moment per unit volume. Thus the charge carried out 
over da is P * n da, and the total charge carried out over the whole 
surface is the surface integral of this quantity. 

If p' is the resulting charge density within the volume, set up by 
the displacement of charge, the charge that has been carried out will 
be — /p' dv. Thus we have 

y^P-n da - f v P' dv - (hi) 

Using the divergence theorem, the left side can be rewritten / div P dv, 



Sec. 2] 


DIELECTRICS 


43 


since the relation (1.1) must hold for any volume, the integrands must 
then be equal, or we have 

div P = — p'. (1.2) 

We may now combine this equation with Eq. (3.3), Chap. II, which 
we may write eo div E = p, where p represents the volume density 
of charge. We must think carefully how to describe the situation, 
however. We have two types of charge density: the charge that we 
deliberately place on our conducting or dielectric bodies, and the 
charge that automatically appears as a result of polarization. We 
shall call the first sort the “real charge/ 7 the second sort the “polarized 
charge. 77 Both sorts can produce electric intensity E, so that, in 
Eq. (3.3), Chap. II, the charge density that appears should be the sum 
of real and polarized charge. 

It is customary, however, to use the symbol p to denote merely the 
real charge, so that for instance p will be zero within an or dinar y 
dielectric. Thus we must replace our equation by 

€ 0 div E = p + p', 

which means the same as Eq. (3.3), Chap. II, but is expressed differ¬ 
ently because of the different meaning we now assign to p. Using 
(1.2), we then have 

eo div E = p — div P, div (e 0 E + P) = p. 

The combination e 0 E + P comes into the theory so often and in such 
an important way that it is given a special name, the electric displace¬ 
ment, and a special symbol, D. Thus we write 

div D = P , D = e 0 E -1- P. (1.3) 

Equation (1.3) is one of the fundamental equations of electromagnetic 
theory, and is one of Maxwell’s equations. 

2. The Dielectric Constant.—Since P is proportional to E, the 
displacement D is also proportional to E. We define the ratio of D 
to E as the permittivity, and denote it by e. Thus we have 

D = € E. (2.1) 

The ratio of e to e () , which we may call the permittivity of free 
space, is the dielectric constant, sometimes called the “specific induc¬ 
tive capacity, 77 and is denoted by /c„. Thus we may write 

tie = — > D = 0 E. (2.2) 

CO V ' 



44 


ELECT ROM AGN ETISM 


[Chap. TV 


The ratio of polarization P to «<JE is called the “susceptibility,” and is 
denoted by xe - Thus we have 


P = X*«oE,. Ke = 1 4- Xe- 


(2.3) 


Clearly the susceptibility is proportional to the polai izability of the 
molecules, and the number of molecules per unit volume: in fact, since 
P = (JV/F)q:E, we have 

N “ (2.4) 


Xe T 7 


V 


We thus have means for finding the dielectric constant of a material, 
if we kn ow the polarizability of its molecules. There is one word 
of caution to be expressed, however. It turns out that the polar¬ 
izability of a molecule is affected by the presence of neighboring 
molecules. Thus in (2.4) we cannot use the value of a that would be 
obtained for a molecule in the absence of neighbors, and expect the 
resulting dielectric constant to be correct for a dense dielect ric. Wo 
shall take up this correction later. 

3. Boundary Conditions at the Surface of a Dielectric. — There are 
two fundamental equations of electrostatics: (1.3) and Kq. (3.4) of 
Chap. I, or 

div D = p, curl E = 0. (3.1) 

These equations take on special forms in the case commonly met in 
electrostatics, in which the dielectrics consist of materials each with a 
uniform dielectric constant, but with surfaces of discon¬ 
tinuity of dielectric constant between the material**. 
Within a homogeneous dielectric containing*no charges, 
we have div D = div eE = 0, so that div E = 0. From 
curl E = 0 we conclude as before that E = — grad 
and we can then find that V‘V = 0, so that Laplace’s 
equation is satisfied for the potential, as in the absence 
of a dielectric. 

At a surface of discontinuity, however, the situation 
is quite different. There e is a discontinuous function of 
position, so that we cannot infer from div «E - 0 that 
div E = 0; it will not be true. Let us instead investi¬ 
gate an uncharged surface of discontinuity between two 
dielectrics by Gauss's theorem, using arguments like those in ('hap. 11, 
ec. 1. Over unit area of the surface we erect a thin rectangular 
vo ume, as in Fig. 9. We note, from the divergence theorem and the 
equation div D = p, that Gauss’s theorem becomes 



Fig. 9.— 
Gauss’s theo¬ 
rem for nor¬ 
mal compo¬ 
nent of D. 
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/* D • n da = f y p dv . (3.2) 

Since there is no real charge within our volume, this tells us that 
the total flux of D out of the volume is zero; that is, that the flux in 
over one of the faces equals the flux out over the other, which demands 
that the normal component of D must be the same on both faces. In 
other words, we conclude that the normal component of D is continu¬ 
ous at an uncharged surface of separation between two homogeneous 
dielectrics. 

By an argument similar to that which we have just applied, we may 
apply Eq. (1.1), rather than (3.2), to the small volume of Fig. 9. 
Let us suppose that the dielectric is to the left of the sur¬ 
face of separation, empty space to the right. There will 
then be polarization to the left, and not to the right, so 

that j s P * n da will be the negative of the component 

of polarization along the normal pointing away from the 
dielectric. By (1.1), this equals the polarization charge 
included within the volume. This must clearly be a 
surface charge, and since unit area of the surface is in¬ 
cluded in the volume, we have the result that the normal 
component of P, pointing out of a dielectric, equals the 
surface polarization charge that appears on the surface 
as a result of the polarization. It is of course clear that, 
since the normal component of D is continuous at an 
uncharged surface, and the normal component of P is not continuous, 
the normal component of E must likewise be discontinuous. 

Next we consider the implication of the equation curl E = 0. We 
draw a contour as shown in Fig. 10, one long side on one side of the 
surface of separation, the other on the other. By Stokes's theorem 
in vector analysis, the surface integral of the normal component of the 
curl of a vector over a surface equals the line integral of the tangential 
component of the same vector around the contour bounding the sur¬ 
face. Applying this theorem to a surface bounded by the contour 
in Fig. 10, we note that, since curl E is zero, the line integral of the 
tangential component of E about the contour must be zero, which 
means that the tangential component of E must be equal on both 
long sides of the surface. We then have derived the following general 
results: 

From div D = p: normal component of D continuous 

From curl E = 0: tangential component of E continuous, (3.3) 



Fia. 10.- - 
Stolces’s the¬ 
orem for sur¬ 
face of dis¬ 
continuity. 




where these conditions hold at an uncharged surface of separation 
between dielectrics. 

4. Electrostatic Problems Involving Dielectrics, and the Con¬ 
denser— Without the principles derived in this chapter, it is almost 
impossible to see how to solve electrostatic problems involving dielec¬ 
trics as well as charges. The dielectrics acquire polarized charges on 
their surfaces, and if we do not know the amount and distribution of 
these charges, we cannot use our ordinary electrostatic principles 
to compute the field. The problem is not unlike that which we met 
in electrostatic problems involving conductors, where we also did not 
know th e distribution of surface charges. We solved that problem 
by obtaining a potential that was a solution of Laplace’s or Poisson’s 
equation in the region outside conductors, but satisfied certain bound¬ 
ary conditions at the surface of a conductor: the potential had to 
reduce to an appropriate constant on the surface of each conductor, 
which implied that the field E was normal to the surface, or that the 
tangential component of E was zero outside this surface. We see 
that this condition was a special case of (3.3): since E is zero within a 
conductor, the tangential component of E must be zero outside, by con¬ 
tinuity. Equation (3.3) gives no information about the normal com¬ 
ponent at the surface of a conductor, for the surface is not uncharged. 

We now notice that a method essentially similar to this one ran 


be used in solving an electrostatic problem involving dielectrics. 
Inside each dielectric, provided that its dielectric constant is uniform, 
the potential will satisfy Laplace’s or Poisson’s equation, where the 
charge density involved is the real charge, which we assume we know 


about. At each surface of discontinuity, the conditions (3.3) must bo 
satisfied by the appropriate components of D and E. Combining with 
the relation (2.1), D = eE, the problem is determined. As a first 
almost trivial example, consider the field of certain charges embedded 
m an infinite dielectric of permittivity € . By Gauss’s theorem, 
JE • n da - (l/«)/p dv. This differs from the corresponding equation 
for free space only in substituting e for « 0 . Thus Green’s solution of 
Poissons equation will carry through just as in Chap. 11, Sec. 1, with 
the one difference that e will appear in the denominator rather t han 

JL®? t t he pot °?* ia1 ’ aad the field E, of a set of charts in a uni¬ 
form dielectric, will be just 1 /k. as large as if the same charges were 

ZStJZST SPa -° e ' W ? nOti0e ’ howcver > that the displacement 
■ barges in a uniform dielectric will be the same as if they 

were in empty space. - 


As a next example of electrostatic problems 


involving dielect rics, 
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consider a condenser filled with a unif orm dielectric of permittivity e, 
rather than with empty space. We can use the same solution of 
Laplace’s equation to represent the potential that we did for the case 
without dielectrics; the only difference is in the relation between the 
charge on the plates and the potential difference. For a given voltage 
between the plates, there will be the same field E everywhere in the 
case with dielectric that there was without it. D will, however, be 
K e times as great. By Gauss’s theorem, the surface charge at a 
charged surface will equal D, so that the surface charge on the plates 
will be K e times as great as if the dielectric were absent. The capacity, 
being the charge divided by the voltage, will then be K e times as great 
as if the condenser had no dielectric but empty space. It is interesting 
to notice that, at the surface between the dielectric and the conducting 
plate, there will be not only the real surface charge D, but also be a 
polarization surface charge, equal to — P, or — (k 6 — l)eoE. The 
total charge, real and polarization, on the surface, will then be cqE, 
the same as the total charge for the condenser in the absence of 
dielectric. The reason a dielectric increases the capacity of a con¬ 
denser, or the charge that its plates carry for a given voltage, is that a 
fraction (k c — 1)/k 6 or 1 — 1 / k € of the charge is effectively canceled by 
polarization charge, leaving only the fraction 1 /K e for producing a field. 
Thus, to produce a given field, we must have k 6 times as much charge 
as in the absence of dielectric. 

5. A Charge outside a Semi-infinite Dielectric Slab.—In both the 
problems we have so far considered, the infinite dielectric and the 
dielectric filling a condenser, the effect of the dielectric is very simple: 
the field distribution is just as in empty space, but a given charge 
produces a field only 1 / k c as strong as in empty space. If this situa¬ 
tion were always true, the problem of dielectrics would be trivial, and 
we should not have had to go through all the theory that we 
have presented in this chapter. We shall now give two examples to 
show that in general the whole problem is entirely different in the 
presence of dielectrics from what it is without them. First let us take 
the problem of a charge q, in empty space, a distance d from an infinite 
plane surface bounding a semi-infinite dielectric of permittivity «. We 
can solve this problem by an application of the method of images, similar 
to that in Chap. I, Sec. 4, but more complicated. In Fig. 11 we shall 
try to satisfy our conditions by the following assumptions: in the free 
space, to the left of the surface of separation, we assume that the 
potential is q/AireoVi — q' /4ir€ 0 r 2 , where ri is the distance from the 
charge to the point where we are finding the potential, r 2 the distance 
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from the image to the point, and where the second term is the potential 
of an image charge — q f , whose magnitude we must still determine. 
In the dielectric to the right of the plane, we assume that the potential 
is q"/4xreri, where q" is also to be determined. These assumed poten¬ 
tials satisfy the first requirement: in the empty space the potential 
satisfies Poisson’s equation, being determined from the charge g; in 
the dielectric it is determined by Laplace’s equation, the charge pro¬ 
ducing the field being located out¬ 
side the dielectric. Clearly wo 
cannot have a term varying in¬ 
versely as r 2 , in the dielectric, for 
there is no real charge located at 
the image of q. 

We must now try to satisfy the 
boundary conditions on the sur¬ 
face. For the tangential compo¬ 
nent of E, we note that tho 
tangential derivative of potential 
must be the same on both sides of 
the surface, which means that the potential itself is continuous; this 
may be used as a substitute for the condition on the continuity of the 
tangential component of E. Thus, remembering that ri = r» at points 
of the surface, we have 


For the normal component of D, we note that the charges q and ~q r 
will produce fields that add rather than subtract. Remembering 
that we find D by taking the gradient of the potential and inultipl ving 
by the dielectric constant, this results in 



Fig. 11. —Images for point charge outside 
dielectric slab. 


arg 


Smce (5.1) and (5 2) can be solved for q' and q” in terms of q, our oomli- 

°Tr tlb r’ OU f assumed Potentials give a solution of the 
problem. Solving for q f and q" } we have 


^ _ Ke — 1 


CL" = 


2 K c 


-f 1 


< 2 - 


(5.3) 


Lines of force for this problem are shown in Fig. 12. 

+h + lu 1 ea f lly cileck two special cases of (5.3). First if K - 1 
so that the whole space is really empty, we have g' = 0 , ’g" = ~ £ 



Sec. 6] 


DIELECTRICS 


49 


that the field is just that of the charge q in empty space. Secondly, if 
k 6 is infinite, 5 ' = q, and q" — 2 q. The field E in the empty space 
in this case is that produced by the charge q, and an image —q; the 
field within the dielectric is zero, being a field of a finite charge in a 
medium of infinite dielectric constant. Thus the field in this case 
is everywhere the same that we should have for a charge q outside a 
perfect conductor. There are as a matter of fact many ways in which 
a medium of infinite dielectric con- 
stant resembles a conductor. 

6 . Dielectric Sphere in a Uni¬ 
form Field.—As another example of 
problems involving dielectrics, we 
consider a dielectric sphere in a 
uniform external field, in empty 
space. Following Eqs. (4.1) and 
( 4 . 2 ) of Chap. Ill, we try the as¬ 
sumption 


<p = -h 


cos 8 



outside the sphere, where Eq is the 
field at large distances, and where 
b is to be determined. This as¬ 
sumption correctly gives the field at 
large distances, and it satisfies La¬ 
place’s equation; if b can be chosen 
to satisfy the boundary conditions 
at the surface of the sphere, or 
radius R, we shall have a solution of our problem. Let us also try 
the assumption that the field is uniform, and equal to E 1 , within the 
sphere. That is, the potential within the sphere is —E\r cos 8. As 
in the preceding problem, we may take as our two boundary condi¬ 
tions the continuity of the potential, and of the normal component of 
D. For the potential to be continuous, we must have 


Fig. 12.—Lines of force for point charge 
outside dielectric slab. 


^ — EqR + -^ 2 ^ cos 8 = —EiR cos 8 , 


or 


E 0 R + — = -EiR. 

( 6 . 1 ) 


For the continuity of the normal component of D, we note that the 
direction of increasing r is the normal direction. Thus inside the 
sphere the normal component of D is —e\d<p/dr = e\Er cos 8, and 
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outside it is (e Q Eo + 2eo b/R*) cos 9. Equating, we have 

eiE 1 = *oE 0 + ( 6 . 2 ) 

Solving (6.1) and (6.2) simultaneously, we find 

Ei = 0 — E 0} 4ir€ob — Ak6qR z , 6 °) Eq. (0.3) 

Aeo ~r €i Aeo ei 

• 

Thus we have found the complete solution of our problem. The 
quantity 4 -jt€ 0 6 is the dipole moment of the dipole that would produce 
the same field at external points as the polarized dielectric sphere. It 
is interesting again to consider the two limiting cases ei == e 0 , and 
€i=oo. In the first case, we find Ei — Eq, b = 0, so that the field 
is uniform everywhere, as of course it must be if the dielectric sphere 
is not really present. In the second case, we find Ei = 0, 

47T€o?> = 4vT€oR 3 Eo, 

so that again the infinite dielectric constant gives the same fields as 
for a conductor, as discussed in Chap. Ill, Sec. 4. 

There are several remarks to be made about the solution we have 
found. In the first place, there are only very few problems in which 
we have the simple situation found here, that the field inside the 
dielectric object is uniform. This situation holds, in fact, only for an 
object of ellipsoidal shape. The general ellipsoid can be solved exactly, 
as well as the sphere, but by considerably more advanced methods 
than that used here. In the second place, our solution holds equally 
well for the field within an empty spherical hole in the dielectric of 
permittivity ei. As we see from the derivation, to get this result 
we need only interchange e 0 and ei in the solution (6.3). It is interest¬ 
ing to consider the field Ex within the cavity. The ratio E x /E n can 
be written 


El = i _|_ t 1 . ~ € ° = L 1 i __ 2(ci — € 0 ) 
E o 2ei eo €o 2ci -f- eo 


(6-4) 


Thus the field inside the cavity is greater than in the dielectric [wince 
(ei — € 0 )/(2ei + € 0 )' is greater than zero], but is less than ei/eo* times 
the field in the dielectric. 

7. Field in Flat and Needle-shaped Cavities.—In the preceding 
section we have found the field within a spherical cavity in a dielectric. 
It is interesting also to find the field in a disk-shaped cavity whose 
normal points along the field, and in a needle-shaped cavity pointing 
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along the field. In the flat cavity, by symmetry, the field inside as 
■well as outside the cavity will be parallel to the field at a large distance, 
so that, by the continuity of the normal component of D, the value of 
3D within the cavity will be equal to that in the dielectric. That is, 
the value of E within the cavity will be €i/€o times the value of E in the 
dielectric, or will be k b times the value of E in the dielectric. On the 
other hand, in a needle-shaped cavity, because of the symmetry, 
the field will again be everywhere parallel, and because of the con¬ 
tinuity of the tangential component of E, the value of E within the 
cavity will equal that in the dielectric. 

These facts are sometimes used to give definitions of E and D in a 
dielectric, in a form that could be made the basis of an experimental 
method of measuring them: E is the force on unit charge in a long 
needle-shaped cavity parallel to the field, and D is «o times the force 
on unit charge in a flat disk-shaped, cavity with its normal parallel 
to the field. These definitions are analogous to these introduced in a 
similar w r ay by Lord Kelvin for defining the corresponding magnetic 
vectors. It is interesting to note that these two cavities are the 
limiting forms of ellipsoidal cavities as they get very flat or very 
elongated, and the solution for the field in an ellipsoidal cavity, which 
we have mentioned in the preceding section, reduces to these two 
values in the two limits. In the intermediate case of the sphere, we 
have seen in the preceding section that the field in the cavity is inter¬ 
mediate between these two limiting values. 

Problems 

1. A line charge of linear density <r is placed in a medium of dielectric constant 
ki = «i/€o parallel to and at a distance a from the plane boundary with another 
medium of dielectric constant k 2 = ta/e<i. Find the. potential in both media, and 
show that the force per unit length acting on the line charge is given by 

<r 2 3 4 K\ — Ks 
4.-7T€ \(l K 1 -f- K2 

2. A long circular cylinder of radius a and permittivity e is placed with its axis 
perpendicular to a uniform electric field Eu in air. Find the potential inside and 
outside the cylinder. 

3. Find the capacity of a spherical condenser consisting of a conducting sphere 
of radius r a , a dielectric of dielectric constant m for n> < r < r i, a dielectric of 
dielectric constant /< a for r i < r < ?••>, and a conducting sphere of radius r 2 . 

4. Find the capacity per unit length of a cylindrical condenser consisting of a 
conducting cylinder of radius r<>, a dielectric of dielectric constant k\ for r 0 < r < n, 
a dielectric of dielectric constant k 2 for ri < r < r 2 , and a conducting cylinder of 
radius r 2 . 





CHAPTER V 


MAGNETIC FIELDS OF CURRENTS 

In Chap. I, we stated that the force on a charge q, moving with a 
velocity v in a magnetic field, in which the magnetic induction is B, 
is q(v X B). Similarly we stated that the force on a charge q in an 
electric field E was qE. So far we have been studying the electro¬ 
static case. We began by stating the value found experimentally 
for the electric field E produced by a charge q. Combination of that 
statement with the law of force allowed us to get the force between 
two charges. We shall now begin a similar treatment of magnetic 
forces. Magnetic fields can be produced, and magnetic forces experi¬ 
enced, by two types of bodies: by charges in motion, or electric cur¬ 
rents, and by magnetized bodies, such as permanent magnets. In 
some cases, the magnetization of magnetic bodies arises from the 
motion of currents within the atoms, coming from electronic motions, 
but in other cases it comes from magnetization that is an intrinsic 
part of the structure of the nuclei or electrons of which the atoms are 
composed. We could start our study from the fields either of magnets 
or of currents. We choose to start with currents, as being in a way 
more fundamental. We shall then show that a current flowing in 
a small closed path is equivalent to a magnetic dipole, and forms a 
model for the dipoles present in magnetic materials, which we shall 
take up in the next chapter. 

The magnetic field produced by a steady current, which does not 
vary with time, is much simpler than that of a current that varies 
with time, for in the latter case we can have radiation. The problem 
of magnetic fields of steady currents is called “magnetostatics.” It 
is clear that a single charge in motion cannot produce a static mag¬ 
netic field, for by its very motion it is found at different points of 
space at different times. Thus we cannot strictly state the field 
produced by a charge in motion. Nevertheless, if the charge forms 
part of a steady current, as if there were a procession of charges, 
following one after the other in a formation independent of time, then 
there is a simple law for the field it produces. This law, elucidated 
by the work of Oersted, of Ampere, and of Biot and Savart, in the 
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early days of the nineteenth century, may be made the basis of the 
treatment of magnetostatics, just as the law of Coulomb may be made 
the basis of electrostatics. 

1. The Biot-Savart Law. —It is found experimentally that the 
magnetic induction resulting from a charge q, moving with a velocity 
Vj at a distance r away from the charge (where r is a vector pointing 
from the charge to the point where the field is being found) is 


■d _ mo q(v X r) 
B “ 4x |r| 3 ’ 


( 1 . 1 ) 


provided that the moving charge forms part of a current distribution 
independent of time. In this expression, B is the magnetic induction 
in webers per square meter. Since this is not a familiar unit, we can 
state that 1 weber/sq m equals a magnetic field of 10 4 gausses, in the 
more usual units. The quantity po is 


ju 0 = 4-7T X 10~ 7 henry/m, (1.2) 

where we shall find the physical significance of this number later. 
The charge q is in coulombs, the velocity in meters per second (a 
vector), and r is in meters. By virtue of the vector product in (1.1), 
we see that the lines of B (that is, lines parallel to the vector B) form 
circles in a plane normal to the velocity vector v, with centers at the 
intersection of that vector and the plane, and in such a sense that a 
right-hand screw advancing along v would rotate in the direction of B. 
If 6 is the angle between v and r, the magnitude of B is equal to 

ITDI _ Mo g|v| sin e 
11 4t r |r| 2 * 

We are often interested not in the field of a moving charge but in 
that of an element of current, as a length ds of wire carrying a current i. 
We can easily set up an equivalence between the quantity qv and a 
corresponding quantity for the current element. Suppose the cross 
section of the wire is A, and the charge density of charge in the wire 
is p. Assume this charge is moving with an average velocity v. Then 
the charge crossing any cross section per second is pvA. On the other 
hand, if the current flowing is i, measured in amperes, then i must 
be the charge crossing a cross section per second. Thus we have 
pvA = i. If we multiply both sides by ds, the length of the section 
of wire, we shall have on the left pA ds v, which is the product of the 
charge density, the volume, and the velocity. That is, it is the total 
charge contained in the element of wire, times its velocity. This then 
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equals i ds. We may make this into a vector equation by letting v, 
and ds, both be vectors along the velocity of the charge, or the direc¬ 
tion of the wire. Thus we have 

qv = i ds, 

so that the magnetic induction resulting from a length ds of wire 
carrying a current i is 

<2B = y . I )■ (1.3) 

4tt (r| 3 v ' 

The law described by (1.1) or (1.3) is often called the “Biot-Savart 
law.” As a result of it, and Eq. (1.1) of Chap. I, for the force acting 
on a moving charge, we can at once find the force between two ele¬ 
ments of current, fidsi and i 2 ds 2 , flowing in different conductors. 
This force is 

,-p _ Mo • . [dsi X (ds 2 X r)] . . 

d¥ = — hi* -j^j-3-> (1-4) 

where we have found the force exerted by the second current element 
on the first, and where r is the vector from the second to the first. 
This law (1.4) is the equivalent, for current elements, of Coulomb’s 
law, Eq. (2.2) of Chap. I, expressing the law of force between static 
charges. 

The law of force (1.4) serves essentially to define the ampere, the 
unit of current. Thus we take two current elements, each of unit 
length, and at unit distance apart, so oriented that the factor 

[dsi X (ds 2 X r)] 

M’ 

in (1.4) equals unity. Then the currents are defined to be an ampere 
if the resulting force is 10“ 7 newtons. From this it follows that 
jno = 4ir X 10~ 7 , as in (1.2), a relation that is now seen to be the result 
of definition rather than of experiment. This on the other hand is not 
the case with e () , which we encountered in Eq. (2.1), Chap. I, and which 
we found from Coulomb’s law. The difference is that we define the 
unit of current, the ampere, from the Biot-Savart law; we define 
the coulomb, the unit of charge, as the charge flowing per second 
in a current of 1 amp; and therefore we are not free to choose the 
constant in Coulomb’s law at will, but must determine it by experi¬ 
ment, with tin 1 ; result given in Eq. (2.1), Chap. I. These relations are 
discussed in more detail in Appendix II. 

2. The Magnetic Field of a Linear and a Circular Current* —We 
can integrate the Biot-Savart law (1.3) to get the magnetic induction B 
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resulting from any arbitrary steady current. Naturally in practice 
we must use such an integrated value to determine the ampere, since 
the interaction between two elements of current, as considered in (1.4) 
or in the preceding paragraph, cannot be separated from the forces 

resulting from other current elements in the circuit. 
First we consider the field of an infinite straight wire 
carrying a current i. From Fig. 13 we see that 
r 2 = R 2 + s 2 , where R is the distance from the wire to 
the point where we are finding the field, and that 
sin 0 = R/\/R 2 + s 2 . Thus we have 



B 


= /_ 


ds 


2wR 


( 2 . 1 ) 


Fig. 13.— 
Magnetic field 
from an Infinite 
straight wire. 


c R 2 + s 2 )K 

The direction of B is, of course, in circles surrounding 
the wire. 

Next we consider the field of a circular wire carry¬ 
ing a current, at points along the axis of the wire. The 
contribution to the field produced by an element 
ds of the wire, at a point whose distance from the plane is x, will be 
(mo/ 4nr)z ds /( x 2 + R 2 ), as we see in Fig. 14. This field points at an angle 
to the perpendicular to the plane, however, and only the component 
along that direction contrib utes to th e resultant field. Thus we must 
multiply by the factor R/sjx 2 + R 2 to get 
the resultant field, which is, after we replace 
the element ds by the circumference 2ttR, 


B = 


Mo 


2ttR‘ 


_ „• 


R 2 



Air (x 2 + R 2 )& 2 " (x 2 + R 2 )* 

( 2 . 2 ) 

We can use the solution (2.2) to find the 
field in the center of a circular current; it is 
noi/2R. This solution is sometimes used 
to provide an experimental determination 
of the ampere, following the procedure 
discussed at the end of the preceding sec¬ 
tion. It is quite a complicated problem to find the field off the axis in 
the case of a circular wire. 

The solution (2.2) can be used to find the field along the axis of an 
infinite solenoid, carrying n turns of wire per unit length. To find 
this we need merely multiply (2.2) by n dx, and integrate over x, 
taking account of the turns of the solenoid located at all distances x 


Fig. 14.—Magnetic field from 
circular current. 
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from a given point on the axis. Thus we have 

* #2 

. (*» + B»)H = ^ ni ; ( 2 - 3 > 

We shall shortly prove that this field is found everywhere inside the 
infinite solenoid, not merely along the axis. 

3. The Divergence of B, and the Scalar Potential. —In our study 
of electrostatics, we first found simple methods of obtaining the 
field by integrating the contributions resulting from all point 
charges according to Coulomb’s law. We soon found, however, that 
this was a method of restricted usefulness. The really powerful 
tools of electrostatics come only when we apply general analytical 
methods, introducing the potential, Gauss’s theorem, Poisson’s equa¬ 
tion, and similar relations. Here too we shall introduce general 
analytical methods. As a first step, we investigate the divergence and 
the curl of B. We shall first show that div B is always zero; then we 
shall show that curl B = 0 in important cases, in which we can there¬ 
fore introduce a potential, but that this is not true in general, so that 
we have to adopt other ways of writing B in terms of a potential, lead¬ 
ing to the concept of a vector potential. 

We first consider the divergence of B. The lines of B form closed 
circles around the axis of a current element, and since the divergence 
of a vector signifies the starting or stopping of the vector, we should 
expect that div B = 0. We can prove this, directly from (1.1) or 
(1.3), and if the theorem holds for the field of an element of current, it 
must hold for the sum or integral of such fields, or for the field of a 
complete circuit. We can prove our result by use of the vector 
relation div (a X b) = b • curl a — a • curl b, where a and b are any 
vectors. We let a = v, b = r/|r| 3 , in (1.1). Since v, or a, is a 
constant, the first term will be zero. Then we have 

divB = curl (3 - x) 

But the vector r/|r| 3 is simply a vector in the direction of r, of magni¬ 
tude 1/r 2 ; that is, it is just like the field of a point charge in electro¬ 
statics, whose curl has been proved to be zero in earlier chapters. 
Thus, from (3.1), we have 

div B = 0. (3.2) 

Equation (3.2) is one of the fundamental equations of electromagnetic 
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theory, one of Maxwell/s equations, which we shall not have to alter 
in our later development. 

Next we consider the curl of B. If current flows steadily in a 
closed loop of wire, we can show that the curl of the resulting mag¬ 
netic induction is zero everywhere outside the wire, and hence that 
we can introduce a potential to describe the problem. We shall prove 
this theorem by setting up a simple and general expression for the 
potential. As a first step, we consider the solid angle 12 intercepted 

by the loop of wire, at the point P 
where we are finding the magnetic 
induction B. If we make a dis¬ 
placement du of Pj as shown in 
Fig. 15, the solid angle will change 
by an amount do. This is the 
same as the change dQ produced 
by a displacement — du of the 
loop. If we make the latter dis¬ 
placement, the change of solid 
angle will be the sum of all the 
elementary changes of solid angle 
intercepted by the small parallelo¬ 
grams bounded by the vectors du, 
ds, as shown in the figure. The 
projection of one of these paral¬ 
lelograms on the normal to the 
vector r is •— (du X ds) • r/|r|, so that the corresponding increment of 
solid angle is — (du X ds) • r/|r| 3 = — du • (ds X r)/|r| 3 . 

Thus the whole change of solid angle, integrating around the cur¬ 
rent loop, is 


du 



.Fig. 15.—Scalar potential of loop of wire 
carrying current. 



ds X r 


But at the same time the change di2 must be grad 12 * du. 
have 


- grad Q = / 


Thus we 


Comparing with (1.3), we see that we have 


B = — ^ i grad - (3.3) 

In other words, the function (/x 0 /4ir)iS2 forms a potential, whose 


Sec. 5] 


MAGNETIC FIELDS OF CURRENTS 


59 


negative gradient gives the magnetic induction B. This is generally 
called a “scalar potential,” since it is a scalar quantity, to distinguish 
it from the vector potential, which we shall soon introduce. It is 
clear that, in cases in which B can be derived from a scalar potential, 
we must have curl B = 0, since the curl of any gradient is zero. 

4. The Magnetic Dipole. —Suppose we have a current i flowing in a 
positive direction around a very small loop of area A, the normal 
to the loop being n. If 9 is the angle between n and the vector from 
the loop to a point P, the solid angle subtended by A at P will be 
A cos 0/r 2 , so that the scalar po¬ 
tential will be 

fioiA cos 9 
Air r 2 



rm • • j j • Fig. 16 .—Surface made up of small current 

This expression depends on posi- loops. 

tion just like the potential (5.1) of 

Chap. Ill for an electric dipole. Thus a small loop carrying a current 
has a magnetic induction B, whose lines of force are like those of an 
electric dipole. We call such a small loop a “magnetic dipole,” and 
define the product iA as the dipole moment. (Some writers define 
the dipole moment as wiA instead, with corresponding change in 
subsequent formulas involving dipole moments.) If this moment is 
called m, we have 

Potential of magnetic dipole = (4.1) 


In terms of this formulation, there is an interesting way in which 
we can regard the magnetic field of a current loop as coming from a 
distribution of magnetic dipoles over a surface spanning the loop. 
In Fig. 16, we may divide the surface into many small loops, letting 
a current % flow in each of them, in a positive direction. These cur¬ 
rents will cancel each other on all the interior boundaries of the 
loops, but not over the outer boundary. Thus the sum of the cur¬ 
rents of all the small loops will give just the current originally present 
in the large loop, so that the magnetic field of the small loops must 
equal that of the large loop. Each of the small loops, of area A, 
however, produces a field like a dipole of moment iA. In other 
words, a distribution of dipoles over the surface, or a double layer, 
whose dipole moment per unit area is i, distributed over any surface 
spanning the current loop, will have the same magnetic induction at 
all points that the current loop itself has. 

5. Ampere’s Law. —Although B is derivable from a potential func¬ 
tion, we must not assume without further study that the line integral 
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of B about a closed contour is always zero, which we should be inclined 
to deduce by analogy with Sec. 3, Chap. I. Let us compute the line 
integral of the tangential component of B . about a curve as shown in 
Fig. 17, enclosing the current. Using (3.3), we have 


JB • ds 




Fio. 17.- 


-Path of integration around 
current loop. 


If we make a small excursion, and return to our starting point, £2 
comes back to its initial value, J d£2 is zero, and the line integral 

JB • ds is zero. If, however, as 
in Fig. 17, our path encloses the 
current, the situation is quite 
different. If we start integrating 
when we are in the plane of the 
loop, £2 will Jbe 2x, corresponding to 
a hemisphere. As we traverse the 
path in the direction shown, £2 will 
decrease from this value to zero, become negative, and when we 
return to our original point it will be — 2r. This is on the assumption 
that solid angles are positive when the point P is above a plane in 
which the current is circulating in a positive direction, as in Fig. 15. 
The net change in £2 is then —4x, so that JB • ds = /x 0 z. In other 
words, when we integrate the tangential component of B about a 
contour enclosing the current i, the integral is /jlqi; whereas when we 
integrate about a contour enclosing no current, the integral is zero. 
The statement of these facts is Ampere’s law: the line integral of B 
about a closed contour, in a region containing steady currents, equals 
Mo times the total current flowing through the contour of integration. 
This general case obviously follows from the derivation we have given, 
if we regard the whole current flow as being made up of many loops, 
some of which thread through our contour of integration, some of 
which do not. 


Ampere’s law can often be used, as Gauss’s theorem was used in 
electrostatics, to get the answers to simple problems in the magnetic 
fields of currents. For instance, we may solve for the field of a linear 
current, taken up in Sec. 2. If we apply the theorem to a circular 
contour surrounding the current, B is parallel to ds by symmetry, and 
is constant around a circular contour. Thus B(2ttR) = p, Q i, 
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as found by direct integration in (2.1). Similarly we can use the 
theorem to find the field inside a solenoid. Setting up a contour as 
shown in Fig. 18, we may assume that the field points along the axis 
by symmetry, and that it is zero outside the solenoid. Then, if the 
length of the contour is unity, the line integral /B • ds is simply the 
value of B inside the solenoid. The current enclosed by the contour 
is nij if there are n turns in unit length, each carrying current i. 
Thus we have B — noni, as in (2.3). Our present answer 
is more general, however, for it holds for any point with¬ 
in the solenoid, not merely for points on the axis. 

Ampere's law, which may be stated in the integral 
form as 

JB • ds = juoSi, (5-1) 

Fig. 18.— 

where 2 i indicates the total current threading through .Ampere’s law 
the contour, may be written in a differential form by for a solenoid, 
using Stokes's theorem. This theorem of vector analysis states that 

JA • ds = / curl A • n da, 

where A is any vector function of position. That is, the line integral 
of the tangential component of A about a closed contour equals the 
surface integral of the normal component of curl A over any surface 
spanning the contour. Using this theorem, we can transform the 
left side of (5.1). To transform the right side, we introduce the 
current-density vector J. This represents the number of amperes per 
square meter flowing in a continuous conductor. The current flowing 
across a surface element da is then J • n da, so that the total current 
flowing through the contour is /J • n da integrated over a surface 
spanning the contour. Thus we have as a result of (5.1) 

f curl B • xi da = mo/J • n da. 

This result must hold for any contour and any surface spanning it; 
thus the integrands must be equal, and since the integrands are equal 
for any direction of the vector n, the vectors themselves must be 
equal, or 

curl B = noj- (5.2) 

This is the differential formulation of Ampere's law. 

We see from (5.2) that, in a region containing no current flow, 
curl B = 0, but this does not hold within a conductor. In empty 
space where curl B = 0, we can introduce a potential, as we have done 
in (3.3), but this cannot be done inside a conductor carrying a current. 
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The potential we introduced in Sec. 3 for a single loop of current was 
multiple-valued; that is, as we integrated B around a contour enclosing 
the current the potential increased by noi, so that if we integrated n 
times around the contour we should increase the potential by nn 0 i. In 
other words, for any single value of the potential, all values obtained by 
adding or subtracting integral multiples of noi are equally legitimate. 
If now we have many current loops in our space, we can change the 
potential by integral multiples of times any one of the currents, by 
traversing suitable contours. With enough current loops, this means 
that we can obtain almost any value of the potential we desire, at a 
given point of space; and if the current is distributed over the volume, 
so that we can enclose any amount by traversing a suitable contour 
within the volume, we can obtain any value of potential whatever 
at a given point of space. In this case the usefulness of the idea of 
potential breaks down completely. The integral of B is not independ¬ 
ent of path, and there is no unique way of defining a scalar potential. 

6. The Vector Potential.—It is clear from our discussion of the 
preceding section that a scalar potential is not useful for discussing 
a magnetic field, except in the special case where the current flows 
in a single loop. Mathematically, we have seen in (5.2) that curl B 
is not in general zero, so that the conditions for introducing a potential 
do not exist. There is, however, another quite different way of setting 
up a potential function, which is much more general, and equally 
useful. This is to set up what is called a “vector potential.” The 
fundamental characteristic of a potential is that it is a function that 
we differentiate to get the vector field we are interested in. The reason 
why the potential is a useful device in electrostatics is that it is easy 
to compute it from the charge distribution, by the solution of Poisson’s 
equation given in Eq. (3.5) of Chap. I. 

The vector-potential solution for the magnetic field of currents has 
the same useful features. The existence of a vector potential is 
based on Eq. (3.2), div B = 0, satisfied by the magnetic induction. 
By a well-known theorem of vector analysis, div curl A = 0, where A 
is any arbitrary vector; that is, the divergence of any curl is zero. It 
seems reasonable to assume from this that we can write 

B = curl A, (6.1) 

where A is a vector function of position, which is the vector potential 
we have spoken of. By making the assumption (6.1), (3.2) is auto¬ 
matically satisfied, and we must choose A so as to satisfy (5.2) as 
well. This gives us 


curl curl A = /*oJ. 
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By a theorem of vector analysis, curl curl A == grad div A — v 2 A, 
where A is an arbitrary vector. We shall now assume that 

div A = 0. (6.2) 

We are allowed to make this assumption; it turns out to be the case 
that, to determine a vector function uniquely, we must specify both 
its divergence and its curl at all points of space, and (6.1) leaves the 
divergence undetermined. Making the assumption (6.2), we then 
have 

V 2 A = —moJ. (6.3) 

That is, A satisfies Poisson’s equation, as <p was shown to in Eq. (3.4), 
Chap. II. Solving by Eq. (4.3), Chap. II, for an unbounded region, 
we have the general solution 



A solution similar to Eq. (6.1) of Chap. Ill, for a bounded region, 
can be set up as well. In (6.4) we have a general solution for the 
vector potential, and hence for the magnetic induction, of any known 
distribution of currents. 

We may show from (6.4) that we are in fact led to the same solu¬ 
tion from the Biot-Savart law which we have already discussed, just 
as the general solution of Poisson’s equation led in the electrostatic 
case to the same result as an elementary discussion of Coulomb’s law. 
In the first place, suppose the current density J is flowing in a length 
ds of a conductor of area a. Then Ja = i, the current flowing in the 
conductor, and J dv = Ja ds — i ds. Thus the vector potential aris¬ 
ing from a current element i ds, where we regard ds as a vector, is 

dA = iA S , ((5.5) 

4tt v K ' 


a vector in the direction of the current element. To find the value 
of B arising from this vector potential, we take its curl. Noting 
that ds is a constant vector, r a scalar function of position, we need 
to use the formula curl (/F) = / curl F -f- grad / X F, where / is a 
scalar, F a vector function. In our case F is constant, so that we have 

dB — i grad X ds. 

But grad (1/r) = —1/r 2 grad r, and grad r = r/|r|. Thus finally 


dB = 


Mo .ds X r 
4t * ” [if 
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in agreement with (1.3). We have, then, given a proof of the Biot- 
Savart law from Eqs. (3.2) and (5.2), and have illustrated our method 
of handling the vector potential, but have not arrived at a new way 
of** solving the magnetic problem. The solution for a bounded 
region, simil ar to Eq. (6.1) of Chap. Ill, would be new, but it is not of 
enough practical importance for us to work it out. Although the 
vector potential has given us no new information in this case, we shall 
find that in problems involving time variation of charge and current 
we can still use it, and it then proves a very valuable method of solving 
for the magnetic induction. 

Problems 

1. A steady current flows in a circular loop of wire, of radius R. Find the 
vector potential of the resulting magnetic induction, at large distances compared 
with R, by adding the contributions to the vector potential due to the separate 
elements of current. 

2. Compute the field, from the potential of the preceding problem, and show 
that it is approximately that resulting from a single dipole. Find the strength of 
the dipole, in terms of current and radius R, and check the value derived in the text 
using the scalar potential. 

3. Two parallel straight wires carry equal currents. Work out the magnetic 
induction due to the two together, in the two cases where the currents flow in the 
same or in opposite directions, drawing diagrams of the lines of force. 

4. Find the magnetic induction at points inside a wire carrying a current, 
assuming that the wire is straight and of circular cross section and that the current 
has constant density throughout the wire. 

6. Set up the equation for the vector potential in empty space, in cylindrical 
coordinates, finding components of A along r, 6, and z. {Hint: Set up the equations 
curl curl A = 0, div A = 0, using derivatives of the latter, to help simplify the 
terms.) 

6 . Use the result of Prob. 5 to find the vector potential of an infinitely long wire 
carrying a constant current i. From this vector potential find the value of B. 

7. As in Prob. 5, set up the equation for the vector potential in spherical polar 
coordinates. Use this result to find the vector potential of a magnetic dipole, 
comparing with the result of Prob. 1. 

8 . Two long parallel conductors of circular cross section, each of radius 6, are 
separated by a distance 2a between their axes of rotation. If they carry equal and 
opposite steady currents i, find expressions for the vector potential at all points of a 
plane (the xy plane) perpendicular to the direction of the currents. From this 
obtain the magnetic field at interior points of either conductor and set up the 
integral for the force per unit length with which they repel each other. (Use as an 
origin the point midway between the wires in the xy plane.) Carry the integration 
as far as you can. 

9. Show that, for any current distribution parallel to a fixed line (the z axis), 
the magnetic induction may be obtained from a vector potential that has only a 
2 component. From this prove that the lines of magnetic force are given by the 
equation A{q\,qi) — constant, where A is the z component of the vector potential, 
and qi, q 2 are orthogonal coordinates in a plane normal to the z axis. Apply this 
to the results of Prob. 8, and construct a field plot for this case. 



CHAPTER VI 
MAGNETIC MATERIALS 

Just as a dielectric contains electric dipoles that contribute to 
the field, so there are magnetic media that contain magnetic dipoles. 
These media are of three sorts: diamagnetic, paramagnetic, and ferro¬ 
magnetic. A diamagnetic medium contains no permanent dipoles, 
but only dipoles that are induced by an external field. The atoms 
of a substance contain electrons that are free in a sense to move about 
inside the atom, somewhat like the charge in a perfect conductor. 
We have used this property of the electrons in describing the electric 
polarization of the atom in an electric field. Similarly in a region 
where the magnetic induction changes with time, currents are induced 
in the atom, following the general law of electromagnetic induction 
which we shall take up in the next chapter. These currents circulate 
about the direction of the magnetic induction, and thus produce 
magnetic moments, which prove to be in such a direction that they 
oppose the field already present. A medium that possesses only this 
dipole moment opposite to B is called a “ diamagnetic medium.” 

Some media contain permanent dipoles. In an unmagnetized 
body, the dipoles are oriented at random, just as permanent electric 
dipoles are oriented at random in the absence of an external field, 
but under the action of an external field the dipoles are oriented, result¬ 
ing in a dipole moment which, as in the corresponding electric case 
that we discussed in Chap. IV, is proportional to the external field, 
and inversely proportional to the absolute temperature. This effect 
is called “ paramagnetism.” When it exists, it is usually great enough 
to mask the diamagnetism which is always present, and which has the 
opposite sign. The permanent magnetic moments responsible for 
paramagnetism arise in two ways. In the first place, the theory of 
atomic structure shows that in many cases there is a permanent circu¬ 
lating current of electrons in the atoms, resulting in a magnetic dipole, 
as shown in Chap. V. Such circulating currents were first postulated 
by Ampere, and are often called u Amperian currents,” but the explana¬ 
tion of their magnitude and nature was first given by the quantum 
theory. The diamagnetic atoms are those in which some electrons 
are circulating in one direction, some in the other, in such a way that 
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their dipole moments cancel. Secondly, the quantum theory shows 
that an individual electron possesses a magnetic moment, which is 
just as characteristic a property as its electric charge. In certain 
ways this moment can be ascribed to a rotation of the charge about an 
axis, like the rotation of the earth about its axis, resulting thus in a 
circulating current; but this simple picture of it cannot be entirely 
justified. In any case, any atom that has unbalanced moments of the 
spinning electrons will be paramagnetic; but many atoms that contain 
even numbers of electrons can have equal numbers of electrons oriented 
in opposite directions, so that the net magnetic moment is zero, and 
the atoms are diamagnetic, both orbital and spin magnetic moments 
canceling. 

The final type of medium, the ferromagnetic medium, is really 
an extreme case of paramagnetism. If the permanent dipoles, gen¬ 
erally those resulting from electron spin, are very close together 
in the medium, there proves to be an effect, explainable only on the 
quantum theory, and called “exchange,” which results in a strong 
tendency for the spins of adjacent atoms or molecules to line up 
parallel to each other, even in the absence of an external field. Such a 
parallel orientation can extend? in an unmagnetized body, over volumes 
of a considerable scale on an atomic order of magnitude, though a 
small volume by ordinary standards. Such a volume is called a 
“domain,” and an ordinary unmagnetized ferromagnetic body con¬ 
sists of many domains, each with a strong permanent moment, but 
oriented in different directions. In the presence of an external mag¬ 
netic field, the domains change the orientation of their permanent 
moments, lining them up with the external field, until finally with a 
very large external field the moment reaches a limit when all moments 
are parallel. This limit is called the “saturation moment.” Revers¬ 
ing the field reverses the moments, but there is an effect similar to 
friction, hindering this reorientation, so that, by the time the external 
field is reduced to zero, there can still be a considerable moment. 
This is the origin of permanent magnetism. If the external field is 
reversed alternately between one direction and then the other, the 
moment lags behind the field, resulting in the phenomenon of hysteresis. 

These properties of ferromagnetic bodies are very complicated, 
when one tries to investigate them in detail, in contrast to the diamag¬ 
netic and paramagnetic bodies, in which the moment is proportional 
to the field. The ferromagnetic effect decreases with temperature, the 
individual domains losing their moments at a critical temperature 
called the “Curie temperature.” Above that temperature the body 
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becomes paramagnetic, but the moment, instead of being proportional 
to 1/T, where T is the absolute temperature, is proportional to 
1 /{T — 0), where 6 is the Curie temperature. The physical explana¬ 
tion of this decrease of ferromagnetism as the temperature increases 
is that the tendency toward orientation of the magnetic moments which 
lines them up at low temperatures is opposed by thermal agitation at 
high temperatures. 

1. The Magnetization Vector. —In Sec. 4, Chap. V, we defined the 
dipole moment of a magnetic dipole: if a current i circulates in a loop 
of area A, the moment is iA. In a magnetic medium, we shall have 
dipoles distributed through the volume of the material, and we shall 
define the magnetization vector M as the vector sum of the dipole 
moments in unit volume. In a diamagnetic or paramagnetic medium, 
M will be proportional to B, the constant of proportionality being 
negative for* diamagnetism, positive for paramagnetism; in a ferro¬ 
magnetic medium, the relationship between M and B will be much 
more complicated. 

We now observe that the existence of a magnetization vector 
within a medium implies the existence of currents. Suppose as in 
Fig. 19 that we have a small rectangular 
volume, with magnetization along one of 
the axes. If there is uniform magneti¬ 
zation within the volume, we may replace 
the magnetic effect of the volume by a 
current circulating about its faces, as 
shown. If the area normal to the mag¬ 
netization is A, the height h, then the 
total dipole moment is MAh. This 
would bo produced by a current Mh cir- F,a ' 19 '— M “ enot “ ation 8urfaoe 
culatmg about an area A . In other 

words, we must assume a surface current at the surface of the mag¬ 
netized volume, numerically equal to the magnetization. We may 
indicate the direction as well as the magnitude of this surface current 
density by the vector equation 

Surface current = M X n, (1.1) 

where n is the outer normal. This is the surface current that appears 
at a surface of discontinuity between a region with magnetization M, 
and an unmagnetized region. 

It is clear that, more generally, there will be a surface current 
(M' — M") X n at a surface of discontinuity between a region 
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where the magnetization is M', and one where it is M", n being the 
normal pointing from the region M' to that where the magnetization 
is M". Passing to the limit, there will be a current flowing throughout 
a volume, when the magnetization varies continuously from point to 
point within a medium. Let us find the resulting current density. 
In Fig. 20, we show two adjacent volumes, the upper one being 
displaced a distance dy along the y axis from the lower one. Let 



Fig. 20. —Volume magnetization cur¬ 
rent. 

with a:; by a similar argument 1 
Thus combining terms we have 


the magnetic moments, in the x 
direction, be M" = M x (y + dy) } 
M' = M x (y), respectively. Then, 
considering the contour drawn, the 
current flowing in it can be written 
as J' g dxdy , where J' is the current 
density. Thus by the result above 
we have 

[M x {y + dy) 

— M x (y)] dx = —J' x dx dy. 

Passing to the limit as dx, dy become 
small, this becomes 

T , _ 8M X 

V Z A 

dy 

We should also have a component of 
J' in the z direction if M y varied 
> should find a contribution dM v /dx. 


j, _ dM v dM x 
dx dy * 

But this is merely the z component of the vector equation 

J' = curl M, (1.2) 

which is the general relation giving the volume density resulting 
from a variation of M from point to point, a relation of which (1.1) 
is the limiting form at a surface of discontinuity between a magnetized 
and an unmagnetized region. 

2. The Magnetic Field.—The current density J' which we have 
■written in (1.2) is a current density that is necessarily present when¬ 
ever we have a magnetized medium. In our introductory remarks 
we have seen its interpretation, in terms of the flow of electrons within 
atoms, and of electron spins. It is quite different from the current 
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^^nsity J met for instance in Chap. V, Sec. 5, which was a current 
density resulting from the ordinary flow of current in conductors. 
'The situation is similar to that which we met in Chap. IV, Sec. 1, 
Where we found two types of charge: what we called the “real charge,” 
^e charge we placed on the plates of condensers and other conductors, 
^•txd the “polarized charge,” which automatically appeared in dielec¬ 
trics. There we denoted the polarized charge by a prime, and here 
similarly we are denoting the current density J' resulting from magnetic 
Polarization by a prime. We may, if we choose, refer to J as the real 
Current (that which flows in ordinary conductors), and to J' as the 
Polarized current, or the magnetization current. 

It now follows from our whole train of argument that the magne¬ 
tization current is just as effective in producing a magnetic induction 
S-s a real current. In other words, in Ampere’s law, Eq. (5.2) of 
Ohap. V, we must replace J, the current density, by J + J', the sum of 
^oal and polarized current density. . Hence we may rewrite that law 

curl B = MJ + JO = juo(J + curl M), 
from which we deduce at once the relation 



The quantity B/Vo — M which appears in (2.1) is a sufficiently 
important quantity so that we give it a name and a symbol: we shall 
call it the “magnetic field,” or “magnetic intensity,” and denote it 
fc>y H. The process of introducing it is essentially similar to that 
nsed in Chap. IV, Sec. 1, in introducing the electric displacement D. 
"We may rewrite the definition of H in the form 


B = M o(H + M). 

In terms of it, we then have 

(2.2) 

curl H = J 

(2.3) 

as the equivalent of (2.1). 



In the diamagnetic or paramagnetic medium, we have already 
stated that M is proportional to B. Thus both these vectors are also 
proportional to H, and we may write 

B = juH = K m ju oH, (2.4) 

where the dimensionless quantity K m is generally referred to as the 
* 'permeability.” The ratio of magnetization to magnetic intensity 
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is generally called the “magnetic susceptibility ”: 

M = XmH, K m = 1 H" Xm- (2.5) 

These equations are closely analogous to Eqs. (2.3) of Chap. IV, 
though no does not appear in the same way in which €o did in the 
earlier case. Unlike the case of dielectrics, Xm can be negative (for 
diamagnetic media) as well as positive (for paramagnetic media). 
The magnitudes are also quite different: for diamagnetic and para¬ 
magnetic media, K m differs by very small amounts from unity, and 
Xm is very small compared with unity, whereas the dielectric constant 
K e can be quite large compared with unity. For ferromagnetic media, 
where M is not proportional to B or H, we can define a permeability 
by (2.4), but it will be a function of H, rather than a constant. It is 
more convenient for such media simply to give experimental curves 
for B as a function of H, as is done in the conventional hysteresis 
curves, which we shall mention in a later section. 

The magnetization vector measures the density of magnetic 
dipoles per unit volume, just as the polarization P measures the 
density of electric dipoles. In Chap. IV, we found that P was associ¬ 
ated with a volume and surface-charge density: by Eq. (1.2) of that 
chapter we found that the volume density p' was given by div P = — p', 
and in Sec. 3 of that chapter we found that at a surface of a dielectric 
a surface-charge density appeared, equal numerically to the component 
of P along the outer normal. In an analogous way we may define 
a volume density of magnetic poles, equal to — div M, and a surface 
density equal to the component of M along the outer normal to a 
magnetized body. Remembering that div B = 0, we see that 
div H = — div M. Thus lines of H will diverge outward from regions 
where the density of poles is positive, or from north poles, and will 
converge to regions of negative density, or south poles. Lines of 
B, on the other hand, are continuous, on account of the equation 
div B = 0. We shall shortly see an example of this behavior of 
magnetic lines, in the case of the magnetized sphere. It should be 
understood that, although we have an analogy between electric charge 
and magnetic pole strength, this analogy is far from complete, since 
electric charges can be separated from each other, whereas magnetic 
poles can exist only in dipoles, or in polarized bodies. 

3. Magnetostatic Problems Involving Magnetic Media. —We have 
found two fundamental equations dealing with the magnetic field, 

div B = 0, curl H = J, (3.1) 

where the first is Eq. (3.2) of Chap. V, and the second (2.3). These, 
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taken together with (2.4), or the corresponding relation for ferro¬ 
magnetic media given by the hysteresis curve, give enough information 
so that we can solve magnetostatic problems involving magnetic 
media as well as currents. Problems in diamagnetism and para¬ 
magnetism can be solved as easily as those involving dielectrics, and 
by essentially the same methods. We generally have a number of 
media, each of uniform permeability, with surfaces of separation 
between. Generally also we do not have any currents J flowing in 
the magnetic media. Thus inside these media we have div B = 0, 
from which, since B = £iH, and /x is constant, we have div H = 0; 
furthermore curl H = 0; H can then be derived from a scalar poten¬ 
tial, which satisfies Laplace's equation because of div H — 0. Thus 
the fields inside the various media can be found by familiar methods, 
and we have only to satisfy boundary conditions at the surfaces of 
discontinuity, and to ensure that our fields behave properly at the 
currents J. The boundary conditions, by methods already used, are 

From div B = 0: normal component of B continuous 

From curl H = J : tangential component of H continuous at 

a surface carrying no current; if there is 
a surface current, surface-current density 
= (discontinuity in H) X n. (3.2) 

As a simple example, let us consider the problem of a solenoid 
filled with a diamagnetic or paramagnetic material. As in Chap. V, 
Sec. 5, we have a solenoid with n turns per unit length, each carrying 
a current i. The solenoid is filled with a material of permeability 
K m . The surface-current density is ni; thus the discontinuity in 
tangential component of H between the inside and outside of the 
solenoid is ni. Since H is zero outside, we see that inside H = ni } and 
B = uni. In other words, the value of H within an infinite solenoid 
is the same, independent of the medium within it, so long as there 
are the same number of ampere-turns in the winding, but the value 
of B is proportional to the permeability of the medium. This problem 
is one that shows us the proper units to use for measuring H: since 
H = ni, we measure magnetic intensity in ampere-turns per meter. 
A problem similar to the solenoid, and conveniently realizable experi¬ 
mentally, is the toroid, a ring-shaped piece of magnetic material, 
wound round and round with windings, so that the magnetic lines 
circulate within the ring. Simple application of Ampere's law shows 
that, just as in the solenoid, the value of H is equal to the number of 
ampere-turns per meter in the winding, provided that the cross section 
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is small compared with the length of the toroid, and the value of B 
is n times as great. 

Another simple problem is the field produced by any distribution 
of currents in an infinite space filled with a medium of permeability 
Km. Because of the equation curl H = J, the magnetic intensity H 
produced by the currents will be independent of **». Thus B will be 
Km times as great as in the corresponding problem in empty space. 
Since the force on another current element is proportional to B, we 
see that the force between two current elements immersed in a medium 
of permeability Km is Km times as great as if the same current elements 
were in empty space. We may take account of this, in Eq. (1.4), 
Chap. Y, for the force between two current elements, by replacing the 
quantity ju 0 which appears in that expression by fx. We must remem¬ 
ber, however, just as in the corresponding case of dielectrics, that the 
general case of a number of different magnetic media with surfaces of 
separation between them is a complicated problem, and that it is by 
no means true that the value of B at any point is merely K m times as 
great as it would be if no magnetic media were present. 

4. Uniformly Magnetized Sphere in an External Field.—Just as in 
the corresponding dielectric problem taken up in Sec. 6, Chap. IV, 
the problem of a uniformly magnetized sphere in an external field is 
one that can be solved exactly, and that is of considerable physical 
importance. Let us first consider a sphere of radius It with a uniform 
magnetization M along the axis of spherical coordinates, without an 
external field (such as we might have from a permanent magnet). 
We can later superpose a constant external field. We can easily find 
that the field of this sphere is as follows: 

[ 2 

-, B = -z juoM, along axis 

I o 

(MR* \ 

grad f cos 0 1 B = M oH. (4.1) 

We can verify this solution as follows: As in Sec. 6, Chap. IV, B and 
H outside the sphere are the field of a dipole, of moment %irR z M 
(where we use Chap. Y, Sec. 4, to get the moment from the scalar 
potential). This is simply the moment of a sphere of radius R , with 
a constant density of magnetization M. The field outside the sphere 
is then a gradient of a scalar potential, so that it is a solution of the 
equations governing B and H in empty space. Inside the sphere, the 
constant values of B and H are likewise solutions of these equations. 
We can find the normal components of B, and tangential components 


Inside the sphere, H = — 
Outside the sphere, H = 
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of H, inside and outside the sphere, at radius R, and show them, to be 
continuous. Finally, inside the sphere we have the relation 

B = mo(H + M), 

satisfying Eq. (2.2). 

The lines of B inside and outside the sphere are then as given in 
Fig. 21. Because of the relation div B = 0, the lines are closed, the 
normal flux being continuous at the surface. On the other hand. 



Fig. 21.—Lines of force of uniformly magnetized sphere. 


although the lines of H outside the sphere are like those of B, the lines 
of H inside run in the opposite direction. Thus lines of H originate 
from the right face of the sphere (the north pole of the magnet), and 
terminate at the left face (the south pole). If now the magnet is in a 
constant external field Ho, whose corresponding value of B is Bo, 
pointing along the axis, and equal to moHo, we need only add these 
constant values Ho and Bo to the solution of (4.1), both inside and 
outside the sphere, to get the complete solutions. We thus find that 
inside a magnetized sphere placed in an external field Ho, the magnetic 
field is only H 0 — M/3. This effect, by which there is a term —M/3 
subtracted from Ho because of the magnetization, is called the “ demag¬ 
netizing effect,” and the factor lij, which depends on the geometry of 
the sphere, is called the “demagnetizing factor.” 

We can now use these relations, together with a hysteresis curve, 
to investigate the magnetic moment that a ferromagnetic sphere 
would acquire in a given external field. If B , H, represent the values 
within the sphere, in an external field Ho, we then have 
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H - H 0 - y’ 


Ho + 


= -2H + 3H 0 . (4.2) 


H 0 =O 


H n = const. 


If we know the relation between B and H characteristic of the material 
(that is, the hysteresis curve), we can then find the intersection 
of this curve with the curve B/no = —2 H + 3H Q , from (4.2). This 
intersection will give us the values of B and H inside the sphere in 
any external field. From those, we can get M, and hence the mag- 

. netic moment of the sphere. An 

' Mo example of a hysteresis curve, with 

the straight lines (4.2) superposed, 
is given in Fig. 22. Conversely, by 
a measurement of the magnetic mo¬ 
ment, we can work backward and 
find the hysteresis curve. 

As an example of this method, 
we may ask what are the values of 
B and H within the sphere, if we 
first magnetize to saturation, and 
then remove the external field. If 
Ho = 0, we have B/fxo = —2 H, 
and the values of B and H are de¬ 
termined by the intersection shown 
in the figure. We note that this 

Fig. 22.—Hysteresis curve. 0 

has a much smaller value of B 
(and hence a much smaller magnetic moment) than if the demagnetiz¬ 
ing factor were smaller. For suppose the factor is L , so that 

H = Ho - LM, 



I 


B/jx 0 = H 0 + (1 — L)M. Then we have 



Thus, as L approaches zero, the slope of the straight line corresponding 
to Ho = 0 becomes negatively infinite, or the line coincides with the 
axis of ordinates in the BH diagram, so that the B inside the perma¬ 
nently magnetized object becomes much greater. 

This situation can be approached in a practical case. It turns 
out that we can obtain an exact solution of the problem of a uniformly 
magnetized body, similar to (4.1), not only for the sphere, but for the 
ellipsoid; though the solutions are much more complicated, involving 
elliptic integrals. When the ellipsoid is infinitely elongated along the 
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axis, the magnet approaches an infinitely thin bar magnet; in this case 
the demagnetizing factor approaches zero. The opposite limit is the 
flattened disk-shaped ellipsoid, in which the demagnetizing factor 
approaches unity, so that B/i u 0 approaches Ho, and in the absence 
of an external field there is no B within the magnet. Because of the 
possibility of getting exact solutions, accurate experiments on the 
magnetic behavior of magnetic materials are generally made on 
samples of an ellipsoidal shape. As a final example of similar argu¬ 
ments, we can find the values of B and H within ellipsoidal cavities 
in a magnetized medium, as we did in Sec. 7, Chap. IV, for the cor¬ 
responding dielectric case. We find that with a long thin cavity 
the value of H equals the value Ho in the medium at a large distance 
from the cavity, whereas with a disk-shaped cavity the value of H 
equals the value of Bo in the medium at a large distance, divided by no. 

6. Magnetomotive Force.—Practical magnets are not made in the 
spherical or ellipsoidal shapes discussed in the preceding section, but 
rather in the shape of a closed ring of 
some shape with an air gap, and the 
external field is supplied by windings 
surrounding the ring, as shown in Fig. 

23. In this case, if the permeability of 
the magnetic medium is high, the lines 
of force will mostly flow through the 
magnetic material except near the gap, 
where the lines of force will partly cross 
the gap where we are trying to produce 
a high field, and will partly leak around 
the sides of that gap. It is almost im¬ 
possible to calculate accurately the 
form of the resulting field; we do not 
have the simplifying feature that the 
field is constant within the magnetic medium, as we do with the ellip¬ 
soidal or spherical magnet. We can, however, make certain assump¬ 
tions that allow us to use a certain amount of theory in discussing such 
a magnet. 

Let us assume that the leakage is concentrated in a relatively short 
part of the magnet, near the poles. Then, if the cross section is 
uniform through most of the length, we note that the flux of B must 
be the same through most cross sections (because of the lack &f leak¬ 
age) and hence B must be constant along the length of the magnet 
(because of the uniform cross section). Let this constant value of B 



magnet. 



76 


ELECTROMAGNETISM 


[Chap. VI 


within the magnet be B 1 , and the corresponding value of H be Hi. 
Furthermore let the length of the magnetic material, along a mean 
circumference, be h. In the gap, if there were no leakage, the value 
of B would be the same as inside the magnet; because of leakage, 
however, it will be smaller. Suppose the value of B in the gap is then 
Bo — ctB h where a is a factor less than unity, which is sometimes called 
the “leakage factor.” Then the value of H in the gap will be 


TT B o aB i 

Mo Mo 

Let the length of the gap be l 0 . Now we apply Ampere's law to a 
contour through the middle of the magnet. From (2.3), Ampere's 
law may be stated in the form 


/H • ds = ni, ( 5 . 1 ) 

where there are n turns of wire, each carrying z amp, threading the 
contour. We then have approximately 

H olo + H x li — ni. (5.2) 

We may then rewrite (5.2) in the form 


Bi _ h tt , ni 

—-r rz i “i— tt- 

Mo odo ato 


(5.3) 


This equation, relating Bi and Hi, is essentially equivalent to 

(4.3) . In place of the factor (1 /L — 1) multiplying H we have 
h/odo) in place of H 0 we have the term in ni. It is customary to 
call the number ni, which is the number of ampere-turns, the “mag¬ 
netomotive force” (mmf), by analogy with the electromotive force 
JE • ds met in the electric field. As in the preceding section, in a plot 
of Bi versus H x a line of constant mmf will be a straight line of negative 
slope. In an actual case, li will ordinarily be large compared with 
Z 0 , and a will be rather small. Thus the case resembles that of a small 
demagnetizing factor, which we have discussed in connection with 

(4.3) , so that the value of Bi inside the magnet will be large if the 
min f is removed, as in a permanent magnet. 

We may note a number of interesting consequences of our theory. 
If the magnet is operated as an electromagnet, with large mmf, the 
value of Bi will be inversely proportional to l Q ; that is, for a large gap, 
we shall have to have a correspondingly high mmf to get saturation 
of the ferromagnetic material, which we presumably want to do to 
operate the magnet efficiently. A small value of a, on the contrary, 
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allows us to saturate the material with a smaller mmf, but it gives 
correspondingly smaller fields in the gap. If the magnet is operated 
as a permanent magnet, the field in the gap, which we are primarily 
interested in, is given by 

H«-Hi, 

so that, if the material is used with a given value of jBi and of Hi, the 
field in the gap will be inversely proportional to the length of the gap, 
for constant magnet length h. It can be shown, from arguments of 
energy that we are not yet prepared to understand, that a permanent 
magnet can be made with least material if it is made with those par¬ 
ticular values of jBi and Hi which make the product B 1 H 1 a maximum; 
if this condition is applied, the dimensions of a magnet designed 
for a given field and given gap are determined. 

Problems 

1. An electron of charge e, mass ra, rotates in a circle of radius r with velocity v. 
Using classical methods, show that the magnetic moment is proportional to the 
angular momentum. If the angular momentum is h/2 tt, where h is Planck’s 
constant, find the magnetic moment, in mks units. If we make a simple model of 
a ferromagnetic substance by putting one such moment at each point of a simple 
cubic lattice of lattice spacing 3 A, find the magnetization per unit volume, M, 
when all moments are parallel. 

2. A sphere of radius a carrying a uniform surface-charge density <r is rotated 
about a diameter with constant angular velocity co. Calculate the magnitude and 
direction of the magnetic intensity H at the center of the sphere. Show that this 
value of H is the same at all points inside the sphere. 

3. A long circular cylinder of permeability is placed in a uniform external 
magnetic field that is perpendicular to the generators of the cylinder, resulting in a 
uniform magnetization of the cylinder. Find expressions for B and H inside and 
outside the cylinder, and the demagnetizing factor. 

4 . A long straight wire carrying a steady current i is placed parallel to and at a 
distance a from a large thick plane slab of permeability K m . Show that the field 
inside the slab is that which would be produced by a current 2i/(<cm + 1) in the 
same wire embedded in an infinite medium of permeability n m . Show further 
that the field in the space in front of the slab is that produced by a current i and 
another current £(«»» •— 1)/(% -hi) parallel to and at a distance a, behind the 
interface, both in empty space. What is the force per unit length acting on the 
current-carrying wire? 

6. Discuss the magnetic field of permanent magnets, showing that H may be 
obtained as the negative gradient of a single-valued scalar potential which 
satisfies Poisson’s equation VN 1 ~ — p' > with p' — div H. If one defines magnetic- 
pole strength as p = J p' dv, show that two concentrated poles exert mechanical 

forces on each other given by F — — if is the permeability of the medium 

in which they are embedded. 



CHAPTER VII 


ELECTROMAGNETIC INDUCTION 
AND MAXWELL’S EQUATIONS 

The history of electromagnetism has shown as its most conspicu¬ 
ous feature the gradual discovery of interconnections among problems 
that were at first supposed to be separated. The two oldest fields 
were electrostatics, and the magnetism of permanent magnets and of 
ferromagnetic bodies. Early in the nineteenth century Oersted and 
others demonstrated the magnetic effects of continuous currents, 
bringing together the study of the electric current, which developed 
with the discovery of various forms of batteries in the eighteenth 
century, and the study of magnetism. Faraday, soon after this work, 
began looking for a converse effect. He reasoned that, if currents 
could produce magnetism, magnets should be able to produce currents. 
His first idea was simple, but wrong. He wound two coils of wire 
together, but insulated from each other, and planned to pass current 
through one of them, converting it into an electromagnet. He hoped 
that this magnet, with its lines of induction threading through the 
other coil, would cause a continuous current to flow through that coil, 
just as a continuous current produces a continuous magnetic field. 
His experiment did not show such a current; even though his battery 
was powerful enough so that his primary coil was heated red-hot, still 
no current flowed in the secondary. But he was a good enough 
observer to notice that, though there was no steady current in the 
secondary, there was nevertheless a transient current when the current 
was started in the primary, and a transient in the opposite direction 
when the primary current was interrupted. This suggested to him 
that the effect he was seeking really existed, but that the induced 
current for which he was searching was proportional, not to the mag¬ 
netic flux itself, but to its time rate of change. This law of electro¬ 
magnetic induction, which bears Faraday’s name, is the foundation 
of the study of electromagnetic theory. 

Faraday thought, not in mathematical language, but in terms of 
lines of force. We have already seen how this concept, leading to 
the field theory of electrostatics and magnetostatics, allows us to 
formulate problems involving dielectrics, conductors, and magnetic 
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materials in a form that is much more powerful than any method based 
on the concept of action at a distance. In electromagnetic induction 
as well the idea of lines of force, and of flux, proved to be of the great¬ 
est value, the statement of Faraday’s law involving the time rate 
of change of magnetic flux through the coil. The proper mathematical 
formulation of the methods of lines of force did not come, however, 
until the work of Maxwell, a number of years later. Maxwell brought 
together the various laws of electrostatics and magnetostatics which 
we have already discussed, and correlated them with Faraday’s law, 
expressed in a similar differential or vector form. He was able to 
see that the system of laws so built up was mathematically inconsistent, 
and that to make it consistent he had to introduce a certain quantity 
called the “displacement current,” which was in principle susceptible of 
experimental observation, but which had not been observed at the 
time. By combining these laws, he set up the equations known as 
Maxwell’s equations, which have been the foundation of the theory 
of electromagnetism ever since. These equations did not merely 
describe the phenomena known at Maxwell’s time; they predicted 
the existence of electromagnetic waves, which later proved to be 
identical with light waves, thus enormously extending the range of 
the theory. These advances we shall take up in later chapters; we 
now consider the detailed nature of the law of electromagnetic induc¬ 
tion, and its role in electromagnetic theory. 

1. The Law of Electromagnetic Induction.—It is found experi¬ 
mentally that, when the flux of magnetic induction B through a wire or 
other conducting circuit changes with time, a current flows in the 
circuit. The law of electromagnetic induction can be stated most . 
simply, not in terms of the current set up in the circuit, but in terms 
of the emf leading to the current. By definition, the emf around a 
circuit equals the total work done, both by electric and magnetic 
forces and by any other sort of forces, such as those concerned in 
chemical processes, per unit charge, in carrying a charge around the 
circuit. Faraday’s law states that the emf induced in a circuit equals 
the negative of the rate of increase of flux of B through the circuit. 
The flux of B is by definition the surface integral of the normal com¬ 
ponent of B over a surface whose perimeter is the circuit in question. 
We may then write Faraday’s law in integral form in the following 


way: 



E • ds 



B • n da. 


(i.i) 


The term on the left is the emf; in the case of electromagnetic indue- 
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tion, the force acting on the charge is, as we shall soon see, an electric 
force. The term on the right is the time rate of decrease of magnetic 
induction through the circuit. The surface integral is to be extended 
over any surface spanning the circuit. Because div B = 0, this 
integral will be the same over any such surface; for if we take two such 
surfaces, the total flux of B out of the volume enclosed by the two 
surfaces must be zero, so that the flux through each surface must 
be the same. In (1.1), the emf is to be expressed in volts, and the 
flux in webers; we note that 1 volt must be equivalent to 1 weber/sec. 

It is a familiar fact associated with electromagnetic induction 
that there are several ways in which the magnetic flux through a 
circuit can change. First, the circuit itself may move from one part 
of the magnetic field to another. This is the process used in the 
ordinary electromagnetic generator, or dynamo. Secondly, the circuit 
may be stationary, but the magnetic induction may be a function of 
time, either because it is produced by currents that are varying with 
time, or because magnetic materials such as permanent magnets are 
moving. If B is changing with time, but the circuit is fixed, we may 
take the time derivative in (1.1) inside the integral sign. Also we 
may transform the left side of (1.1) into a surface integral, by using 
Stokes’s theorem. Thus we have 



curl E • n da 



n da. 


This equation must hold for a circuit of any shape, and with any 
direction of the normal n; thus the integrands on the two sides must 
. be equal, or we have 

curl E = - (1.2) 


This equation is the differential form of Faraday’s law, and as we 
shall see later it is one of the fundamental equations of electromagnetic 
theory, one of Maxwell’s equations. 

2. Self- and Mutual Induction.—If a current i\ flows in a circuit, 
it produces a magnetic field Bi proportional to i\ at all points of 
space. If ii changes slowly, the magnetic field is very closely the 
same at any instant as it would be if the current had a constant value 
equal to its instantaneous value; such a condition is known as a 
* ‘quasistationary state.” There will then be a time rate of change 
of Bx proportional to the time rate of change of i\. The flux of Bi 
through the circuit itself will then vary as ii does, so that we shall 
have an emf in this circuit proportional to dii/dt, which we may 
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write as 

, r dii 

emfi — —L ~T 7 > 
at 

where L is the coefficient of self-induction. Similarly the flux of Bi 
through another circuit will vary as i\ does, so that the emf in a 
second circuit is 

emf 2 — —M 

Cut 

where M is the coefficient of mutual induction. 

We can easily set up general expressions for the coefficients of 
self- and mutual induction by use of Eq. (6.5) of Chap. V, expressing 
the contribution to the vector potential resulting from a current 
element i ds. From the discussion above, L equals the flux of B 
through a circuit per unit current flowing in it, and M the flux of 
B through a circuit per unit current flowing in another circuit. But 
remembering that B = curl A, we have 

/B • n da — / curl A • n da = JA • ds, 

in which the last line integral follows from Stokes's theorem. Using 
Eq. (6.5), Chap. V, which is 

dA = £2 —’ (2.1) 

47r r 

and setting i — 1, we have 

LmM = Zj / ^ 7 ^" ( 2 - 2 > 

Here ds and ds' are two elements of length, and r is the distance 
between them. The integral is a double integral, since each of the 
elements of length is integrated around the contour. For L , both 
ds and ds' are to be integrated around the single circuit 1; for M , 
ds is to be integrated around circuit 1, ds' around circuit 2. Because 
of the symmetrical formula, we see that the mutual inductance 
between two circuits is the same, no matter which of them we regard 
as the source of the current, and which as the one in which the emf 
is being induced. 

In many special cases, we can, make direct calculations of the 
coefficients of self- and mutual inductance, by simpler methods than 
the use of (2.2), which in practice usually involves complicated 
integrations. For, if we can find the magnetic flux resulting from 
the system of currents in the circuits, we can often integrate these 
fluxes over the circuits directly. Thus, for instance, consider the 
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self-inductance per unit length, of two parallel wires of radius a , at 
distance D apart between their centers, shown by Fig. 7, which was 
used earlier in computing the capacity of the same parallel wires. 
To find the magnetic field produced by a unit current flowing up 
through the right-hand wire, down through the left-hand wire, we 
first consider unit current flowing up through an infinitely thin 
conductor located at point 2 of the figure 1 , and an equal current 
flowing down through a linear conductor at point 1. We note an 
analogy between the electrostatic and magnetic problems: the scalar 
potential <p in the electrostatic case, for a distribution of charge 
along these lines, is given by Eq. (3.5) of Chap. I, and the vector 
potential A in the magnetic case for a distribution of current is given 
by Eq. (6.4) of Chap. V. These formulas are the same, except for 
two differences: the formula for A has no in the numerator in place of 
€o in the denominator, and the formula for A involves current density 
in place of charge density, and. hence results in a vector. 

Since every current element is in the same direction in our present 
problem, A in this case will also be in the same direction, normal to 
the plane of the paper in the figure. If k is a unit vector along this 
direction, and A is the magnitude of A, we have A = k A. To take 
the curl of A, and find B, we use the rule of vector analysis that 

curl /F = / curl F + grad / X F, 

where / is a scalar, F a vector. Letting / stand for A, F for k, and 
remembering that F is then a constant vector, we have 

B = curl A = grad A X k. 

If on the other hand we had had unit charge along the lines instead 
of unit current, the electric field would have been 

E = — grad <p , 

where as we have just seen <p equals A/eon o- Thus we see that the 13 
resulting from unit current, and the E resulting from unit charge, arc 
related by the equation 

B = eoMo(k X E). 

The magnitudes, in other words, are proportional, and the vectors are 
at right angles to each other, so that the lines of magnetic force 
are identical with the electric equipotentials in the problem of Fig. 7. 
In particular, the circular boundaries of the wires are lines of magnetic 
force. 

Now, to integrate the normal component of B over a surface 
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spanning the circuit, we may integrate over a plane surface including 
the two lines 1 and 2. The normal component of B in this case is just 
eoMo times the tangential component of E in the corresponding elec¬ 
trical case. Thus the surface integral of the normal component of B 
over unit, length of the transmission line, integrating between the 
boundaries of the two cylinders, is eo/xo times the corresponding differ¬ 
ence of potential in the electrical case. The inductance is the flux 
of B per unit current; thus it is €ojuo times the difference of potential 
per unit charge, or is eo/xo divided by the electrostatic capacity per 
unit length. Using Eq. (2.1) of Chap. II for this capacity, we then 
find that the inductance per unit length is 


L = ~ cosh ~‘ Ya 


7r 


(2.3) 


In this calculation, we have neglected magnetic flux through the 
interior of the wires. This flux would be small in any case, and 
could almost be neglected. If the wires, instead of being solid, were 
hollow, so that the current flowed in a thin shell on the surface, and 
there was no flux inside, we should have just the condition for which 
(2.3) was correct. The method of calculation we have used here, 
leading to a simple relationship between problems in electrostatics 
and magnetism, can often be used, and can be justified in a much 
more, general case than the one we have considered here. 

3. The Displacement Current.—In the course of our work, we have 
derived four fundamental electromagnetic equations, (1.3) of Chap. IV, 
(3.2) of Chap. V, (3.1) of Chap. VI, and (1.2) of the present chapter, or 


div D 
div B 

curl E 

curl H 


P, 

0 , 


= J. 


eB 

dt’ 


(3.1) 


These are almost Maxwell’s equations, but there is a difficulty with 
the last of them. We have derived it from Ampere’s law, on the 
basis of steady closed currents, and for this case it is correct. The 
difficulty occurs when we try to apply the equation to nonstationary 
Suppose we have a current flowing in an open circuit, as in the 


cases. 


discharge of a condenser. The current starts at the positively charged 
plate, whose charge diminishes as the current flows to the negatively, 
charged plate and annuls the charge there. Thus we can look upon 
the condenser plates as sources or sinks of current. Now, if we take 
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the divergence of the last equation, we have 

div curl H = div J, 

and, since the divergence of any curl is zero, we find that div J equals 
zero, which means that the current is always closed and there are no 
sources or sinks. Thus we are led to a contradiction. 

Maxwell concluded from this that the last equation of (3.1) must 
be incomplete, and that to the term J must be added another term, 
such that the sum of the two had no divergence. We can easily find 
what this term must be, from the equation of continuity for the flow of 
current. The divergence of J measures the flux outward over the 
surface of unit volume. If there is current flowing outward, the charge 
within unit volume must be decreasing, the flux equaling the rate of 
decrease of charge in unit volume. Thus we have 

div J = - ff (3.2) 

We may rewrite this equation of continuity, using the equation 
div D = p, and obtaining 

div (j + it) ■ °- 

In other words, although div J is not zero, the divergence of the 
quantity J + dD/dt is always zero, so that this quantity can mathe¬ 
matically be placed equal to a curl. Maxwell made the assumption 
that the last equation of (3.1) should properly be replaced by 

curl H = J + ( 3 . 3 ) 

The last term dD /dt is called the “ displacement current, ” to distin¬ 
guish it from J, the conduction current. By adding this term to 
Ampere’s law, Maxwell assumed that a time rate of change of dis¬ 
placement produced a magnetic field, just as a conduction current 
does. A. test of IMaxwell J s hypothesis can be made only with very 
rapidly varying fields, in which the rate of change of D with time 
is so great that the displacement current is large compared with the 
conduction current, or so that magnetic forces produced by it are 
comparable with the electric forces due to E. We shall see later 
that these cases are those in which we have electromagnetic waves, 
whose existence is possible only because of the presence of the dis¬ 
placement current in (3.3). Their existence, then, forms a demonstra¬ 
tion of the correctness of Maxwell's hypothesis. 
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To understand the physical meaning of the displacement current 
in a simple case, consider the charging of a condenser. Current 
flows, from one plate through the wire to the other plate. If the 
current is i, this equals the rate of increase of charge on the plate. 
Suppose the plates are of area A, separation d, then the value of D 
between them is D = charge /A , and the displacement-current density 
in the region between plates is current/A. Thus the total displace¬ 
ment current is equal to the conduction current in the wire, so that the 
current is continuous through the circuit. 

4. Maxwell’s Equations.—We now can write Maxwell’s equations, 

curl E = — div B = 0, 

Ol 

curl H = J + div D = p, (4.1) 

in which we note that the divergence equations follow from the curl 
equations by taking the divergence, using the equation of continuity 
(3.2), and integrating with respect to time. These are to be supple¬ 
mented by the relations (2.1) of Chap. IV, and (2.4) of Chap. VI, 

D = eEj B — mH. (4.2) 

These are often called the “constitutive equations.” If the current 
density J obeys Ohm’s law, we often include it also in the statement 
of the constitutive equations. This law is easily stated in differential 
form by considering a small volume, having length L in the direction 
of the current, and cross-sectional area A normal to the current. We 
apply Ohm’s law in the form potential difference = iR. Here the 
potential difference is the field E times the length L of the volume, the 
current is the area times the current density J, and the resistance is 
the specific resistance times L/A. Hence we have 

E L = AJ ^ X specific resistance, 

J = <rE, (4-3) 

where <r, the specific conductivity, is the reciprocal of the specific 
resistance. The unit of conductivity is the same as that of J/E ; 
that is, its units are 1/(ohm-meter) = mhos per meter, where 1 mho 
is defined as the reciprocal of 1 ohm. It should be noted that 1 inho/m 
is Koo of 1 mho/cm, the usual unit of conductivity. Ohm’s law in 
the form (4.3) can be added to the relations (4.2). 

Maxwell’s equations, taken together with the constitutive equa- 
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tions, determine the field, when we are given the charges and currents. 
To make a complete set of dynamical principles, however, we need 
two more relations. First is the formula (1.1) of Chap. I, 

F = g(E + v X B), 

giving the force acting on a charge moving with a given speed, or 
the corresponding force acting on the charge and current in unit 
volume, 

F = P E + (J X B). (4.4) 

Secondly, we must have a law determining the motion of charge in 
terms of force acting. If the charge is in a metallic conductor obeying 
Ohm’s law, (4.3), that law provides the necessary relation. For 
electrons and ions moving in empty space, however, as in a vacuum 
or discharge tube, we must use Newton’s law, that the force equals 
the mass times the acceleration, or the time rate of change of the 
momentum. With such a law, we find the field from the charge, the 
force from the field, and the motion from the force, obtaining there¬ 
fore a complete system of dynamics. 

Maxwell’s equations, the constitutive equations, and the force 
equation, as just written, form the foundation of the whole of electro¬ 
magnetic theory, and as far as is known are exactly correct, aside 
from the corrections resulting from the quantum theory, w r hich in a 
sense change the whole formulation of the theory. They allow the 
derivation of the electromagnetic theory of light, and of electromag¬ 
netic waves in general, which we shall shortly take up. They hold 
even for particles moving nearly with the velocity of light, for which 
the theory of relativity must be used in discussing the mechanical 
part of the motion, but for which Maxwell’s methods arc still correct 
for finding the forces. In the older developments of electrical engineer¬ 
ing, the so-called “lumped-circuit theory,” it was possible to operate 
with the limiting case of slow variation with time, in which the dis¬ 
placement current could be neglected. The newer developments of 
microwaves and distributed constants, however, operate with the 
theory of electromagnetic waves, for which the displacement current 
is essential, and Maxwell’s equations as formulated in this section, 
rather than the quasistationary form of ‘them as given in (3.1), must 
be used. 

6. The Vector and Scalar Potentials. —We observe that, if B 
depends on time, curl E 0, so that there is no potential for E. The 
ordinary electrical potential is thus confined to static problems. 
Further, if J or &D/dt 0, there is no potential for B. We have 
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seen in Chap. V how a potential can be introduced for B: one uses a 
vector potential A, possible because div B = 0. That is, we let 

B = curl A. (5*1) 

We can do this even in the general case. And it proves that we can * 
use a scalar potential <p, reducing to the electrostatic potential in 
the case of a steady state, but different in other cases, by a special 
device. The relation that proves to be satisfied is that 

E = — grad <p — ^> (5.2) 

reducing to the familiar E = — grad <p when the vector potential is 
independent of time. To verify these statements, we substitute the 
expressions for E and B in Maxwell’s equations, and see if they can 
be satisfied by proper choice of A and <p. First we note that 

div B — div curl A 


is automatically zero, since the divergence of any curl is zero. Sim¬ 
ilarly, remembering that the curl of any gradient is zero, we find that 
the equation curl E = — dB /dt is automatically satisfied. 

Next we must consider the other two equations, in D and H. To 
find these quantities in terms of E and B, we must use the constitutive 
equations. The relations are simple only if e and /x are constants 
independent of position, and we consider only that case. From the 
equation curl EC — dT)/dt = J, we have, using the relation of vector 
analysis that 

curl curl A = grad div A — V 2 A, 

the result 

. V 2 A - e/i ~~ - grad (div A + < M ff) = ~aJ- 


Similarly, from the equation div D = p, with a little manipulation, 
we find 


W> 4- d 

V-«p — e/x -XT.J + 


dt 


( 


div A + <?/x 


f v\ _ 

dt) 


P 

€ 


Now let us choose A and <p subject to the condition that 


div A + e/x ^7 = 0- 
dt 


(5.3) 


Since div A is so far arbitrary, we can do this. Then the equations 
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for the potentials become 

V 2 A 

vv 

If A and <p satisfy these equations, then, as we stated before, 
the fields determined from them by (5.1) and ( 5 . 2 ) satisfy Max¬ 
well’s equations. The equations for the potentials are of the form 
called “ d’Alembert’s equation,” and as can be seen are extensions of 
Poisson’s equation, obtained by adding the time derivatives. We 
observe that, in regions where there is no charge and current density, 
the potentials satisfy the wave equation, which is the homogeneous 
equation obtained by setting the right side of d’Alembert’s equation 
equal to zero. We shall show in the next chapter that this means 
that <p and A are given by functions representing waves traveling with 
the velocity 1 /\/ep, and that the same thing is true of the fields E 
and H. 

In (5.4) we have set up equations for A and <p in te.rms of pre¬ 
scribed values of the charge and current density. Sometimes, how¬ 
ever, we wish to assume that the current density obeys Ohm’s law, 
and that the charge density is zero, as we should have in the interior 
of a conductor. In that case, assuming (4.3) for J, and proceeding 
in a similar manner, we find that in place of ( 5 . 3 ) we should assume 


— €jU 


d 2 A 

dt 2 


= -mJ> 


d 2 <p p 

— ep —=■ -- 


dt 2 


(5.4) 


div A + <rp<p + ep~ 

dt 


and in place of the equations (5.4) we have 


»2 a 3A d 2 A 

v A - _ 


d(p 


d 2 <p 


T72 

v * ~ Tt ~ (li W 


0 , 

0 . 


(5.5) 


We shall find that these equations, involving first as well as second 
time derivatives, represent damped or attenuated waves, as we should 
expect in an absorbing medium, such as a metallic conductor. 


Problems 

Two parallel conducting strips, of width w, distance of separation d (where 
d<Kw), indefinitely long, carry current in opposite directions, so as to form the 
two conductors of an electric circuit. Find the self-inductance of such a circuit 
per unit length. 
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2. Two concentric thin-walled hollow conducting cylinders, of radii n and r 2 , 
carry current in opposite directions. Find the self-inductance per unit length. 
Find how the result is changed if the inner conductor is a solid conducting rod, 
carrying current uniformly distributed through its interior. 

3. Find the self-inductance of a circuit consisting of a hollow cylindrical con¬ 
ductor of radius r, parallel to an infinite conducting sheet that forms the return 
path of the current. 0 Suggestion: Use the method of images.) 

4 . A magnetic field points along the z axis, and its magnitude is proportional 
to time, and independent of position. Find the vector potential. Assuming that 
the scalar potential is zero, find the induced electric field. Prove by direct integra¬ 
tion, using a circular path, that the law of induction holds. 

5. Describe the magnetic field between the plates of a condenser while it is 

being charged up. ^ 

6. Starting from the induction law, show that the line integral of 

around a closed path is zero, where A is the vector potential. From this show 

that the curl of the above vector vanishes, and hence that E — — grad <p - 

where <p is the scalar potential. 

7. Derive the differential equations satisfied by A and <p for quasistationary 


processes. 

8. Starting with the equation of continuity, and assuming Ohm's law, show 

that the charge density in a conductor obeys the equation - p + ^ =0. Show 

that any existing charge distribution within a conductor will be damped off expo¬ 
nentially, and find the time required for it to be reduced to 1/e of its initial value. 
Insert numerical values for copper, and find the value of this time, the relaxation 
time. Similarly find the relaxation time for a good insulator, such $.s quartz. 

9. A long solenoid of n turns per unit length carries an alternating current of 
angular frequency o> and peak value I. If a nonmagnetic rod of conductivity o~ 
just filling the solenoid is placed inside it, find the current distribution, the magnetic 
field, and the heating loss per unit length inside the copper rod. (The heat loss 
per unit volume per second equals J • E.) Discuss the behavior of your solution 
for the limiting cases of large and small conductivities. How would your results 
be altered if the rod were coaxial with the solenoid but of smaller radius? 

10. The outer conductor of a coaxial line as in Prob. 2 is a thin-walled cylinder 
of radius a and thickness t. Show that the contribution to the inductance per unit 
length of the line arising from the magnetic field in this conductor is given very 


nearly by 


Mo t 

12ir a 


where 



1 . 


11. Show that, in a nondissipative medium if the divergence of the vector 
potential is set equal to zero, instead of satisfying Eq. (5.3), the scalar potential 
satisfies Poisson's equation with p, the density of electric charge, a function of 
coordinates and time. Find the differential equation satisfied by the vector 
potential under these conditions. 



CHAPTER VIII 

ELECTROMAGNETIC WAVES AND ENERGY FLOW 


The first and most conspicuous success of Maxwell's theory was 
its prediction of the existence of electromagnetic waves, whose veloc¬ 
ity of propagation equaled the experimentally known velocity of 
light. It was immediately clear that light must be a form of electro¬ 
magnetic radiation, of short wave length. It was a number of years 
later that Hertz demonstrated experimentally the existence of electro¬ 
magnetic waves of longer wave length, but when they were found, 
they proved, like light, to satisfy Maxwell's equations. In the 
present chapter we shall discuss the simplest forms of waves, plane 
waves; we come to more complicated waves, such as spherical waves, 
in a later chapter. We also consider an aspect of the electromagnetic 
field that we have so far passed over: the energy associated with 
it. We have so far treated problems by considering only the forces 
acting, rather than the work done and the resulting energy. With 
radiation, however, it is so obvious that the field carries energy that 
without a consideration of the energy flow we can hardly form a 
correct picture of the phenomena. We therefore take up, not only the 
density of electric and magnetic energy in the field, which ties in with 
well-known facts regarding the energy in condensers and inductances, 
but also Poynting's theorem, a deduction from Maxwell's equations 
which provides a simple way of finding the flow of energy in any 
electromagnetic field. 

1. Plane Waves and Maxwell’s Equations. —Let us consider the 
problem of solving Maxwell's equations in a uniform material having 
dielectric constant k 6 = e/eo, magnetic permeability K m = ju/Vo, con¬ 
ductivity <r, but not any charge, or any current other than that 
determined by Ohm’s law. Maxwell's equations in this case are 

curl E — —/i —, div H = 0, 
curl H = <rE + € div E = 0. (1.1) 

We take the curl of the first equation and substitute from the second 

90 
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for curl H, obtaining 

curl curl E 


dE 

-<r„ — - e„ 


am 

at 2 ' 


Similarly, we take the curl of the second and substitute from the first, 
obtaining 


curl curl H = 


dH d 2 H 
~ ab¬ 


using the equation of vector analysis, 

curl curl A = grad div A — V 2 A, 

where A is an arbitrary vector function, and using the equations 
div E — 0, div H = 0, these equations become 


V 2 E 

V 2 H 


- afx~ — 


dE 

a 2 E 

L dt 

li 

c* 

1 

dH 

a 2 H 

dt 

~ at 2 = 


- 


0 , 

0 . 


( 1 . 2 ) 


Thus E and H satisfy the same wave equation that we have already 
found in Eq. (5.5), Chap. VII, for the potentials A and <p. We can 
of course get the case of a nonconducting medium by setting <r = 0, 
in which case the middle term of the equation is omitted. The equa¬ 
tions (1.2) are vector equations, which means that each of the six 
components of E and H separately satisfies the same scalar wave 
equation. 

With a wave equation of the form of 

du d 2 it r. 

V 2 u — o-fj. — — — 0, • (1.3) 

where u, a scalar, can stand for one of the components of E, H, A, or 
<p, we can find a great variety of solutions. In fact, a good deal 
of the remainder of our study will be devoted to different solutions 
of this equation. * We shall start with the simplest, and in many 
ways the most important, of these solutions, the plane wave, under¬ 
standing that it is a special case, rather than a general solution. We 
shall assume that it is a function of z, one of the space coordinates, 
and of t, only, being independent of x and y, or constant on planes of 
constant z. Furthermore, we shall assume that u varies with both 
z and t in an exponential manner. We shall write our assumptions 
in the form 

iut—yz 


u — U(,e- 


(1.4) 
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where j — \/—l, and u 0 , «, and y are constants. We shall describe 
the physical significance of these constants in a moment. Substitut¬ 
ing (1.4) in the wave equation (1.3), we find 


y = ± \/crjujco — e/u(o 2 



(1-5) 


In other words, if y is given by either of the values ( 1 . 5 ), the expres¬ 
sion (1.4) is a solution of the wave equation. 

From the form of (1.5) we see that y is in general complex. We 
may write it 

y = o: + j(3, (1.6) 

where a and (3 are real and imaginary parts, respectively. We note 
that the quantity whose square root is being taken in ( 1 . 5 ) lies in the 
second quadrant in the complex plane, its real part being negative and 
its imaginary part positive; thus the positive square root lies in the 
first quadrant, so that a and (3 are both positive. If in particular 
the medium is nonconducting, so that <r is zero, we have y — j(3, 
where (3 is positive. We may now rewrite the solution ( 1 . 4 ) in the 
form 

u = uoe^e^ 031 ^^. (1.7) 

As in the theory of oscillating circuits, for physical purposes we 
use the real part of this complex expression to represent the real 
value of u. Thus (1.7) represents a disturbance whose value at a 
given point of space varies sinusoidally with time, with angular 
frequency o >, or frequency / = oo/2tt. It also varies sinusoidally with 
z, in case a = 0, with wave length X = 2x7/3, so that, when z increases 
by a wave length, the disturbance reverts to its initial value. A given 
wave crest moves with a velocity v, given by 


the velocity being positive, or in the direction of increasing z , for the 
upper sign in (1.7), and negative for the lower sign. The factor 
e =F«* represents a falling off of intensity, or damping, in the direction 
of propagation of the wave, such that the amplitude falls to 1/e of 
its value in a distance 1 /a. We see that the disturbance in a con¬ 
ducting medium is then a damped plane wave, and in a nonconducting 
dielectric it is an undamped plane wave. 

For an undamped wave in a nonconducting medium, the velocity 
of propagation takes a simple form. In that case, setting a = 0, we 



Sec. 1] ELECTROMAGNETIC WAVES AND ENERGY FLOW 


93 


have from (1.5) and (1.6), 

* = " ^ w = i = < 18 > 

If we set 

€ = K e € 0 , At = K m fX o, 

following Eqs. (2.1) of Chap. IV, and (2.4) of Chap. VI, we find 

c 1 

v = c = —7= = 3.00 X 10 8 m/sec, 

n Veono _ 

n = s/ KeK m . (1.9) 

That is to say, the velocity of the electromagnetic waves in empty 
space is 3 X 10 8 m/sec, which is known to be the velocity of light. We 
have already mentioned that it was this fact which originally led 
to the conviction that Maxwell’s equations furnished a theory of light, 
and that Hertz’s discovery of electromagnetic waves has led to a 
series of waves of this type which can exist for frequencies, over 
apparently an unlimited range. We now are familiar with waves 
ranging from ordinary radio waves at frequencies of a few kilocycles 
per second, short-wave radio at a few megacycles per second, micro- 
waves with thousands or tens of thousands of megacycles per second, 
or wave lengths down to the order of a centimeter, through the infra¬ 
red with wave lengths from fractions of a millimeter down to the visible 
spectrum with wave lengths of a few thousand angstroms, through the 
X-ray region with wave lengths down to a few hundredths of an 
angstrom, and from there through the gamma rays down to wave 
lengths whose lower limit is so far not known. All these waves 
satisfy the wave equation, all are essentially the same type, and in our 
subsequent treatment we shall consider them all. 

The index of refraction n, defined in (1.9) as the ratio of the 
velocity of light in free space to the velocity in a material medium, is 
given in terms of the dielectric constant and magnetic permeability. 
Experimentally, this relation holds very accurately for radio waves, 
and in some materials for the infrared region, but it is almost never 
correct in the visible region or for shorter wave lengths. The reason 
will be taken up in a later chapter. It is essentially the fact that the 
dielectric constant is a function of frequency, rather than a true 
constant. The charges inside an atom or molecule, which become 
polarized to lead to the dielectric phenomena, as we have described 
in Chap. IV, have a certain mass and consequent inertia. For this 
reason they do not polarize instantaneously, but lag to some extent 
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behind the field that polarizes them. The treatment of Chap. IV 
holds only for the steady state, or for the static dielectric constant. 
In a later chapter we shall take up the time variation, showing that 
we obtain a correct picture of the variation of the index of refraction 
with frequency, or of the phenomenon of dispersion. 

2. The Relation between E and H in a Plane Wave. —If we substi¬ 
tute expressions of the form (1.4), with constant vectors E 0 and Ho 
rather than the scalar Uq multiplying the exponential, into Maxwell’s 
equations (1.1), we find the following equations from the various 
components of Maxwell’s equations: 


yE y = — ju j<joH x 
—yEx = —piju>H y 
0 = 


-y H z = 0 

yHy == ((T “ 1 “ ijcO^Ex 

— yH x = (<r + ejo>)Ey 
0 = (<r + <ijo})E e 
— yE g = 0. 


We notice in the first place that E z and H z are zero; that is, there is no 
component of E or H in the direction of propagation, or the wave 
is transverse. The remaining equations can be written in the form 


Ex 

Hy 


Ey _ rjr _ M03 y _ , / JJL 

H x y <r + eju ± e - jc/J 


.(2.1) 


in which the various forms of the constant Z 0 are related through (1.5). 
These express the fact that E and H are at right angles to each other, 
as well as being at right angles to the direction of propagation, and 
that the ratio of the magnitude of E to the magnitude of H is the 
constant Z o. We may write this expression in a vector form, if k is 
unit vector along the z axis; we have 


k X E = Z„H, k X H = (2.2) 

£0 

In the case of empty space, we have 

Zo = * Vt° = ± \ls% X Wi’ = ±376 - 6 ohms ' (2 - 3) 

where the units of Z 0 are most easily seen from the fact that it measures 
a ratio of E, in volts per meter, to H, in ampere-turns per meter, and 
therefore must equal volts/amperes or ohms. Because the units of 
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E/H are the same as those of impedance, the value of Z o is often 
referred to as the “wave impedance” of the medium, and the particu¬ 
lar value (2.3) is the wave impedance of empty space, which in that 
particular case is a pure resistance, though we see from (2.1) that in a 
conducting medium there is a reactive as well as a resistive component. 

3. Electric and Magnetic Energy Density. —In our treatment of 
electrostatics and magnetostatics we worked entirely with forces 
rather than introducing the concept of energy. In Sec. 3, Chap. I, 
however, we introduced the electrostatic potential, from which we 
can see at once that the energy of two charges qi and q 2 , at a distance 
7*12 from each other, is 


v — 

Vl * 4tT€ 07*12* 


(3.1) 


The total electrostatic energy of a system of charges is then the sum 
of expressions like (3.1) over all pairs of charges. A sum over pairs 
of terms like Va can also be written as half the double sum of the 
same quantities over the indices i and j separately (omitting of course 
the case i — j ); the double sum includes each pair of indices twice, 
as for instance the terms Vi 2 (for i = 1, j = 2) and V 2 i, which equals 
it (for i = 2, j = 1), and to compensate this we must have a factor 34* 
Thus we have for the total potential energy 

v = w« (32) 

% 39*1 

But the summation over j, in the expression above, is just the poten¬ 
tial <pi at the location of the charge q t . Thus we may rewrite (3.2) 
in the form 

V = 14% qtvt. 

1 

If the charges are continuously distributed, we may replace the summa¬ 
tion over charges by an integration, replacing by p dv, the charge 
in the volume dv. Thus we have 

V = H J Pv dv. (3.3) 

This is the standard expression for the energy of a system of charges, 
in terms of the charge density and the potential. 

The expression (3.3) can be rewritten in terms of the field, rather 
than of the charge and potential. We have, for the electrostatic 
case, 

E = — grad <p, div D = — div (e grad <p) = p, 
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which is the form that Poisson’s equation would take in general where 
* may be a function of position. Using this result, (3*.3) takes the form 

V = —%f<p div (e grad <p) dv. (3.4) 

But by simple vector analysis we have 


div ( e<p grad <p) — <p div (e grad <p ) + «(grad <p ) 1 2 . 
Substituting in (3.4), the energy V becomes 

V — dv + %fe<pE • n da. 

If we integrate over such a large volume that ^ and E are negligible 
around the surface, the surface integral vanishes, leaving us with 


V = dv. 


(3.5) 


This is the expression for the electrostatic energy in terms of the 
field. Its interpretation is that we may imagine the energy to be 
localized throughout the field. 

For instance, suppose we take a condenser of capacity C, and let 
its charge at a given moment be q. Assume that we are charging the 
condenser, and that we wish to know how much work we shall have 
to do on it to charge it. To take a small additional charge dq around 
the circuit, against the difference of potential q/C , will require an 
amount of work ( q/C) dq. Thus the whole work done in setting up a 
charge Q is 



q , 1 Q 2 


But if the condenser consists of two plates of area A, distance of 
separation d , filled with a dielectric of dielectric constant e/eo, the 
value of JD inside the condenser is Q/A , and the value of E is ( Q/e)A. 
Since the capacity C is eA/d, we then have 


1 Q 2 _ 1 (eAE) 2 

2 C 2 eA 


d = i {eE*)Ad. 


But, since Ad is the volume of the region in which the field is different 
from zero, this agrees with the expression (3.5). Thus we see in this 
example how we can interpret (3.5) as meaning that energy is located 
throughout the field, with a density 

In a similar way we can consider the magnetic energy to be local¬ 
ized in space. A complete treatment of the magnetic case is con¬ 
siderably more complicated than in the electrical case, and we shall 
give only a partial discussion; the reader who is interested in a more 
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complete treatment may find one in J. A. Stratton, Electromagnetic 
Theory , Secs. 2.14 to 2.18 (McGraw-Hill Book Company, Inc., New 
York, 1941). In the first place, let us consider a circuit of self¬ 
inductance L, carrying an instantaneous current i. If i increases, the 
emf is — L di/dt, and the work that an external emf must do to 
bring about an increase di of the current is Li(di/dt) di — Li di. Thus 
the total work that must be done to build up a current I in the circuit is 

T = J* Li di = Ho¬ 
using Eqs. (2.1) and (2.2) of Chap. VII, this may be written 

T = KJI ds • A. (3.6) 

In this expression, the integration is around the circuit, I is the 
current flowing in the element ds, and A is the vector potential at 
that point. Using an argument like that above, we can also include 
the mutual effect when many currents are flowing; the only difference 
is that A in (3.6) must be the resultant vector potential not only of the 
one circuit but of all circuits. If the current densities are distributed 
throughout space, the current density being J, we replace an integral 
about a circuit by a volume integral, and have 

T = W .J * A dv. (3.7) 

The analogy of this expression to (3.3) is obvious, and it shows us 
for the first time that the vector potential has much the same relation 
to the energy of currents as the scalar potential has to the energy of 
charges. 

We may now carry out a transformation, as with the electrostatic 
case, so as to write this energy T as an integration of the magnetic 
field rather than of the current density and vector potential. We have 

div (H X A) = A * curl H - H • curl A = J • A — H * B, 

in which we have used Ampere’s law J = curl H in the form that holds 
for the stationary case. Using this result we may rewrite (3.7) in a 
form involving a volume integral of H • B, and a surface integral of the 
normal component of H X A. As before, if we integrate over all 
space, we may assume that the fields arc so small over the infinitely 
distant outer boundary that the surface integral vanishes, and we 
have finally 

T = VzS H • B dv = HjvII 2 dv, (3.8) 

in which the second form holds when B = fxH.. In case this is not so, 
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and we have a ferromagnetic medium, the situation is much more com¬ 
plicated, and we shall not consider that case. The expression (3.8) is 
clearly analogous to (3.5) for the electrostatic energy. In the special 
case of the solenoid we may at once verify the result (3.8), as we did 
in the condenser for the electrostatic case. Let us take a solenoid 
of N turns, length d, area A, in a medium of permeability fx/fxa. If a 
current i flows in it, the magnetic field inside will be (AT/d)/, and the 
induction B will be ( N/d)/xi . The flux of B through the N turns will 
then be NA times B, or ( nN 2 A/d)i , so that the self-inductance will be 
L = txN^A/d. The magnetic energy T is then 

1 T . % XnN*A(Hd\ 2 1 , rr^AJ 

2 ^ =2 S-yw) = 2 

which agrees with (3.8), and fits in with the assumption that the 
density of magnetic energy is 3^/dI 2 . 

The expression (3.5) for the electrostatic energy is convenient 
for solving certain types of problems, because we can find the force 
on a charged body by stating that the work done when it is displaced 
is the change in the potential energy V, assuming the charges to 
remain fixed. For example, we can find the force acting to pull a 
piece of dielectric from one part of a field to another, by solving the 
problem of the field distribution as a function of the location of the 
piece of dielectric, computing the energy as a function of position, 
and seeing how it changes when the dielectric moves. The correspond¬ 
ing magnetic problem is much more involved, however, and is dis¬ 
cussed by Stratton, in the reference given above. 

If we take a circuit carrying a certain current, and displace it in a 
magnetic field, keeping the field and the current constant, the work 
that we must do proves not to be dT , as we should expect at first 
sight, but —dT, On the other hand, in moving the circuit through 
the field, the flux through it has changed, and this has resulted in an 
emf in the circuit. This emf would have resulted in a change of the 
current through the circuit, which is contrary to our hypothesis that 
the current is unchanged. Thus we must have been exerting a 
counter-emf to keep the current constant, and we find that the work 
that our counter-emf has had to do on the current during the displace¬ 
ment is just dT. Thus the work that our countcr-emf does just 
balances the work that we get out of the system by the displacement 
of the circuit, and the net work done is zero. This is consistent with 
an observation that we might have made from Eq. (1.1), Chap. I, in 
which we state that the force exerted on a charge q moving with a 
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velocity v, in a magnetic induction B, is gv X B. This force is at 
right angles to the velocity of the charge, and therefore the work done 
by it is zero. 

On the other hand, we are assuming that the field stays constant 
while the circuit is being moved. Actually this field must be produced 
by other currents in other circuits. As the circuit is moved, the flux 
produced by it at these other circuits changes, and emf’s are induced in 
them. To keep the field constant, and hence the currents in those cir¬ 
cuits constant, we must have been exerting counter-emf’s in those other 
circuits as well, and the work we have done to maintain those 
counter-emf’s proves again to be dT. Thus in the process we have 
a net amount of work dT, made up of —dT as the direct work involved 
in moving the circuit, dT in keeping the current constant in the circuit 
we are moving, and another dT in keeping the current constant in the 
circuits that produce the external magnetic field. Unless careful 
account is taken of these various terms in the energy, whose existence 
we have merely mentioned without proof, there is great danger of 
making a mistake in sign when finding the mechanical forces acting 
on a circuit from the magnetic energy. 

These relations are similar to those found in mechanics in certain 
cases, in which the kinetic energy depends on the coordinates. In 
such a case we find that the force is given by the derivative, not of the 
energy, but of the difference between the potential and kinetic energies, 
with respect to the coordinates. This difference is essentially the 
Lagrangian function. In a similar way here the force is given by the 
derivative, not of the energy V ■+■ T, but of the difference V — T, 
which plays the part of a Lagrangian function, the magnetic energy T 
playing the part of a kinetic energy. This is not unnatural; for 
the electrostatic energy V results from the positions of charges, but the 
magnetic energy T results from their motions, the current i in the 
expression being proportional to the velocities of the charges. 

4. Poynting’s Theorem and Poynting’s Vector.—In the preceding 
sections we have seen that in the static case we have an energy density 
i / 2 ( e /^ 4- jjTI 2 ) of electric and magnetic energy. We shall now show 
that this formula still can be used when the fields are changing with 
time. To show this we shall first prove a mathematical theorem, 
Poynting's theorem, and shall then show its interpretation. By a 
vector identity we have 


div (E X H) = H 


- -E* 


curl E 
dD 


dt 


- E 
H • 


■ curl H 

dB 


dt 


E * J, 
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where in the last expression we have used Maxwell's equations. If 
we assume D = eE, B — juH, we have 

div (E X H) + ^ i («£* + -E • J. (4.1) 

This equation reminds us of an ordinary equation of continuity, 
which states that the divergence of the flux of any quantity, plus the 
rate at which the density of the quantity increases with time, equals 
the ratg at which the quantity is produced. In other words, applying 
the equation to a small volume, it states that the rate at which the 
quantity increases within the volume equals the rate at which it is 
produced within the volume, minus the rate at which it flow's out 
over the surface. The quantity —-E • J represents the rate at which 
energy is produced (that is, E • J represents the rate at which energy is 
lost) per unit volume on account of ordinary joulean or resistance 
heating. Thus the quantity for which (4.1) forms the equation of 
continuity is the energy. We are then justified in interpreting the 
quantity 3^(e£7 2 -f- m# 2 ) as the energy density, extending the results 
of the preceding sections to time varying fields. At the same time, 
we must interpret the vector. 

S = E X H, 

which is known as “Poynting’s vector," as the flux of energy, the 
amount of energy crossing unit area perpendicular to the vector, per 
unit time. 

The conception of the energy of the electromagnetic field as resid¬ 
ing in the medium is a very fundamental one, and has had great 
influence in the development of the theory. Maxwell and his immedi¬ 
ate followers thought of the medium as resembling an elastic solid, 
the electrical energy representing the potential energy of strain of 
the medium, the magnetic energy the kinetic energy of motion. 
Though such a mechanical view is no longer held, still the energy is 
regarded as being localized in space, and as traveling in the manner 
indicated by Poynting's vector. Thus, in a light wave, we shall show 
that there is a certain energy per unit volume, proportional to the 
square of the amplitude (E or H). This energy travels along, and 
Poynting’s vector is the vector that measures the rate of flow, or the 
intensity of the wave. We have already seen that E and H in a plane 
wave are at right angles to each other, and at right angles to the 
direction of flow; thus E X H must be along the direction of flow. In 
more complicated waves as well, Poynting's vector points along the 
direction of the flow of radiation, If, for example, we have ft source of 
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light, and we wish to find at what rate it is emitting energy, we sur¬ 
round it by a closed surface, and integrate the normal component of 
Poynting’s vector over the surface. The whole conception of energy 
being transported in the medium is fundamental to the electromagnetic 
theory of light. 

When we come to charges and currents, however, it is a little 
harder to see the significance of the energy in the medium. For 
example, in a circuit consisting of a battery, and a wire connecting 
the plates, Poynting’s vector indicates that the energy flows out 
of the battery through the space surrounding the wire, and finally 
flows into the wire at the point where it will be transformed into 
heat. This seems to have small physical significance. With a mov¬ 
ing charge, the situation is somewhat more reasonable. A simple 
model of an electron, which was supposed before the quantum theory 
to represent its actual structure, was a sphere of radius R, on the 
surface of which the charge is distributed. Then the field E will be 
€/4 tt€o r 2 at any point outside the sphere, where e is the charge. The 
total electrical energy is the volume integral of (e 2 /32ir 2 €o)(lA 4 ) over 
all space outside the sphere, or 

e 2 f “ 4xr 2 , _ e 2 
327T 2 eo JR r 4 r &TT6 qR 

In the classical theory of the electron, which we have mentioned, it 
is this quantity which is interpreted as being the actual constitutive 
energy of the electron, though a correction must be made of an addi¬ 
tional energy of a nonelectromagnetic nature that is required to keep 
the sphere in equilibrium. Neglecting this correction, we can com¬ 
pute the mass of the electron. For a relation of Einstein’s relativity 
theory says that a given energy has a mass, given by the relation 
energy = me 2 . Hence me 2 = e 2 /8xeo R. Solving for the radius, we 
have R — e 2 /87reomc 2 , a familiar formula for the radius of the electron. 
The more correct formula, inserting the correction we omitted, differs 
only by a small factor. Using the values e — 1.G0 X 10“ 19 coulomb, 
m = 9.1 X 10- 31 kg, c = 3.00 X 10 8 m/sec, <io = 8.85 X 10~ 12 farad/m, 
we have R = 1.42 X 10 -15 m = 1.42 X 10“ 13 cm. Now, if this elec¬ 
tron moves, it will produce a magnetic field, as a current would, and 
hence will have a certain magnetic energy. Since the magnetic field 
is proportional to the velocity (or the current), the magnetic energy 
is proportional to the square of the velocity. This can be shown 
to be the kinetic energy. Further, there will be a Poynting vector, 
pointing in general in the direction of travel of the electron, and 
representing the flow of energy associated with the electron. All 
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these relations prove on closer examination to be more complicated, 
than they seem at first sight, but they are suggestive in pointing one 
possible way to an eventual theory of the structure of the electron., 
which even the present quantum theory is unable to supply completely. 

6. Power Flow and Sinusoidal Time Variation.—We shall often 
want to find Poynting’s vector, and the energy density, in cases where 
the fields E and H are the real parts of complex exponentials such as 
that given in (1.7). At a given point of space, let us assume that; 
E is given by the real part of Eoe 30Ji , and H by the real part of Hoe ?w£ , 
where Eo and Ho are complex vector functions of position. Let the 
real part of Eo be E r , and the imaginary part E,-, with similar notation 
for Ho. Then E is given by 

Re(Eoe 1<ot ) = Re[(E r -f- jEi)(cos cat + j sin coif)] = E r cos coif — Ei sin cat. 
H is given by a similar expression. Poynting’s vector is then 

E X H = (Er X Hr) cos 2 cat + (Ei X Hi) sin 2 cat 

— [(E r X Hi) + (E{ X Hr)] sin cot cos cat. 

We notice that there are two types of terms: the first two, whose 
time average is different from zero, since cos 2 cat and sin 2 cat average 
to and the last term, whose time average is zero, since sin cat cos cat 
averages to zero. Thus the time average Poynting vector is 

Average (E X H) = }4(E r X H P + E» X Hi). (5.1) 

This can be rewritten in a convenient way, by using the notation of 
complex conjugates, where the complex conjugate of a complex number* 
is the number obtained from the original one by changing the sign of j 
wherever it appears, and is indicated by a bar over the number. In. 
terms of this notation, let us consider the quantity (E X H). This is 

(E X H) - (Eoe 3 ' wt ) _X (S 0 <r*“) 

= (Eo X Ho) = (E r X jEi) X (Hr - jHi) 

= (Er X Hr + Ei x Hi) + j(Ei x Hr - Er X Hi). (5.2) 

We see that, except for the factor K, the real part of (5.2) is just the 
same as the quantity appearing in (5.1). That is, we have 

Average (E X H) = }^Re {E X H), (5.3) 

where the E and H appearing on the right side of the equation are the 
complex quantities whose real parts give the real E and H appearing 
on the left of the equation. A similar derivation for the energy 
density shows that 

Average }*ieE z = 3^«E * E, 
with a similar formula for the magnetic energy. 


(5.4) 
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6. Power Flow and Energy Density in a Plane Wave.—In (2.2) we 
have found that E and H in a plane wave are at right angles to each 
other, and at right angles to the direction of propagation. In par¬ 
ticular, we have for a nonabsorbing medium, where Z Q is real, 

X S) = ( E -1) = ikZ„(H • S). (6.1) 

That is, the flow of energy is along the direction of propagation, and 
proportional to the square of the amplitude of E or of H. Similarly 
the average electrical energy density is 3^e(E • E), and the average 
magnetic density is 3^ju(H • H). Because of the relation (2.1) between 
the amplitudes of E and H, we find easily that the average magnetic 
energy equals the average electrical energy, so that the total energy 
density is H«(E • E). We note, from (6.1) and (2.1), together with 
(1.8), that we have the relation 

Energy flux = v X energy density. (6.2) 

This equation has a simple meaning. If the energy were flowing with 
velocity v, in the direction of propagation of the wave, all the energy 
contained in a cylinder of unit cross section, and height equal to v, 
would cross unit cross section per second, forming the flux. 

In a conducting medium, in which a is different from zero, E and H 
will each contain a factor c~ az , so that Poynting's vector and the 
energy density will each have a factor e~ 2a3 , showing that the intensity 
is damped or attenuated by this factor in traveling along. Poynting's 
theorem of course tells us that the energy lost to the wave as it advances 
is used up in resistive heating of the medium. We readily find that, 
in a conducting medium, the electrical energy is no longer equal to the 
magnetic energy, but as the conductivity becomes greater and greater, 
the electric energy becomes smaller and smaller compared with the 
magnetic energy, finally vanishing in the limit of infinite conductivity. 
We shall investigate these relations more completely in the next 
chapter. 

Problems 

1. If the generation of heat per unit volume in a conductor carrying a current 
is <xE 2 , prove that, for a cylindrical conductor of resistance R, carrying a current i, 
the rate of generation is i 2 R. 

2. Given a cylindrical wire carrying a current. Find the values of E and H on 
the surface of the wire, computing Poynting’s vector, and show that it represents a 
flow of energy into the wire. Show that the amount flowing into a given length of 
wire is just enough to supply the energy that appears as heat in the length. Note 
that the surface of a wire carrying current is not an equipotential, so that there 
can be a component of electric field parallel to it. 
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3. Calculate the electrical and magnetic energies in a plane wave traveling in a 
conductor, and show by direct comparison that they are different from each other. 
What happens in the limiting cases c —► 0 and cr —> <», that is, insulators and perfect 
conductors? 

4. In the absence of charges, solutions of Maxwell’s equations may be obtained 
from a vector potential A alone, setting the scalar potential equal to zero. Work 
this out for the general case of plane waves propagated along the z axis, showing 
that the vector potential can have two arbitrary complex amplitudes for its x and y 
components, the z dependence being in the factor e* at ~ y *. From this find the 
relations between E and H in a plane wave, showing that your results are identical 
with those obtained in the text. 

5. A coaxial line of length l, inner and outer radii a and b, carries a steady 
current I. The resistance of the inner conductor is Ri, that of the outer conductor 
R 0 , and the load resistance is R. Find expressions for the components of E and H 
at any point between the conductors. Show that the component of E parallel to 
the axis reverses direction as one moves from the inner to the outer conductor, 
and that it is zero at a radius ro given by In ( r 0 /a ) = [Ri/(Ro + Ri)] In ( b/a ). 
Compute the Poynting vector, and discuss the power flow in this field. Integrate 
the radial component of the Poynting vector over the surface of both inner and 
outer conductor, and show that these integrals yield the I 2 R losses in each. Apply 
the Poynting theorem to a volume bounded by the planes z\ and and the cylin¬ 
drical surfaces ri and r 2 . 

6. Consider a plane wave in empty space given by 

B, - VXwM H v = 

with /3 = « V^ojwo- Show that an approximate solution to Maxwell’s equations 
can be obtained to represent a plane wave of finite cross section by considering E 0 
to be a slowly varying function of x and y and adding to the above field components 
the longitudinal components 

E z - —4 and H t -| ^ «*<»*•*■>. 

/3 dx fi 'no dy 

By slowly varying is meant that the percentage change in the fields is small com¬ 
pared with unity if one moves a distance of one wave length in a direction trans¬ 
verse to the direction of propagation, so that second derivatives and products of 
first derivatives with respect to x and y may be neglected. 

7. Prove that, if a piece of steel is magnetized by an alternating current, the 
energy loss per cycle due to hysteresis is given by • dB per unit volume. 

8 . A magnetic circuit of uniform cross section consists of two sections separated 
by air gaps of length so small that fringing may be neglected. Consider the process 
of increasing the air-gap lengths each by an infinitesimal amount, maintaining the 
magnetizing current constant. Compute the change in magnetic field energy and 
the work done by the emf that maintains the magnetizing current constant during 
the displacement. From this obtain the mechanical work done in effecting the 
displacement, and show that the force with which the two sections attract each 
other is given by B*A/y.o, where A is the cross section of the magnetic circuit. 
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In Chap. VIII we investigated the propagation of plane waves 
in a medium with a given dielectric constant and conductivity. We 
have seen that in a nonconducting nonmagnetic medium a wave is 
propagated with a velocity c/n, where c is the velocity of light in 
empty space, n the index of refraction, which is equal to y/ k 6 . In a 
conducting medium, there is a damped wave, whose propagation 
constant, determining both the velocity of propagation and the rate 
of absorption, is given by Eq. (1.5), Chap. VIII, predicting a definite 
variation of propagation properties with frequency. These two formu¬ 
las give a very straightforward way of testing the electromagnetic 
theory of light: we can measure the dielectric constant and conductiv¬ 
ity of a material, and see if its optical index of refraction and absorption 
coefficient are properly related to these constants. One of the first 
observations after the formulation of Maxwell's theory was that these 
relations are not fulfilled by real substances. The departures between 
observation and the simple theory have been very useful in gaining a 
knowledge of the structure of actual dielectrics and conductors. 
Briefly, the discrepancy between theory and experiment is explained 
by supposing that the dielectric constant and conductivity are func¬ 
tions of frequency; a large part of the electron theory of solids is 
devoted to an explanation of the nature of this frequency variation. 

The experimental situation is much clearer than it was in Max-- 
well's day, because of the wider ranges of the spectrum that have 
been explored in the meantime. We shall first describe the situation 
for dielectrics without conductivity. For long radio wave lengths, 
waves can be propagated with a velocity given by the index of refrac¬ 
tion determined from the static dielectric constant. As the frequency 
increases, however, the index of refraction starts to increase, until 
somewhere in the spectrum it goes through a maximum, then drops 
suddenly, and begins to increase again. The phenomenon of change 
of index of refraction with frequency is called “ dispersion," and the 
sudden drop we have just mentioned is called “anomalous dispersion.” 
In the neighborhood of a region of anomalous dispersion, there is 
absorption, even when the material does not absorb elsewhere; the 
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index of refraction and absorption coefficient in this neighborhood 
behave much like the reactance and resistance of a resonant electric 
circuit. Most materials have several regions of anomalous dispersion 
and absorption, in different parts of the spectrum. Many materials, 
for instance, have absorption in the infrared, and practically all 
materials absorb in the ultraviolet. Materials that absorb in the 
visible part of the spectrum are those which appear colored. By the 
time we go through the ultraviolet to the X-ray region, anomalous 
dispersion has stopped, and the index of refraction of all materials has 
become almost exactly unity; it is for this reason that lenses and 
prisms are practically impossible in the X-ray region. The absorption 
has also largely decreased in the X-ray region, and it is for that reason 
that X rays can penetrate so many types of materials. Passing 
beyond, to the region of gamma rays, all materials become rather 
transparent and nonrefracting. 

For condu ctors, we fo und in Eq. (1.6), Chap. YIII, that y is 
given by jco -%/ (e — jV/co)^. We see that as the frequency approaches 
zero the second term in the radical becomes infinitely greater in magni¬ 
tude than the first. Thus at low frequencies the optical properties of a 
conductor are determined entirely by the conductivity, and not by 
the dielectric constant; in fact, there is no experimental way of finding 
the dielectric constant of a good conductor at low frequencies. The 
values of <r for good conductors, such as metals, are such that <r/e 
does not become comparable with co until we reach frequencies in the 
visible part of the spectrum. It is found experimentally that the 
optical properties of metals are well described by this simple theory 
through the radio-frequency, microwave, and infrared parts of the 
spectrum, and that, as we should expect, the dielectric effects begin 
to be important in the visible region. The situation there, however, 
is much more complicated than would be indicated by Eq. (1.5), 
Chap. VIII; for by then, there is good evidence that both e and a vary 
in complicated ways with the frequency. The variation of e for a 
metal is not unlike that for a dielectric; there is anomalous dispersion, 
and consequent absorption, superposed on the conductivity. As wo 
go through the ultraviolet to the X-ray region, the effect of conductiv¬ 
ity becomes negligible, and metals are no more absorbing than other 
materials. 

We shall now examine the way in which simple electronic models 
of dielectrics and conductors can lead to variations of dielectric con¬ 
stant and conductivity with frequency which are in at least qualitative 
agreement with the observations. 
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1. Dispersion in Gases. —In Chap. IY, we described the nature 
of dielectric polarization, stating that it can arise in two ways: from 
the polarization of the atoms or molecules, by the displacement of the 
electrons in them; and by orientation of dipoles already existing, under 
the action of the external field. We shall discuss only the first type 
of polarization in the present chapter. We shall find in a later section 
that, in a solid or liquid, the problem of polarization is complicated 
by the interactions between the molecules or atoms; but in a gas, the 
molecules are far enough apart so that we can neglect the interactions 
between them. Each molecule contains charges that can be displaced 
under the action of an external field, and these charges act as if they 
were held to positions of equilibrium by restoring forces proportional 
to the displacement. Thus in a static case an electron of charge e is 
acted on by the force eE of the external electric field, and — ax, a 
linear restoring force, proportional to the displacement x, with a 
constant of proportionality a. The displacement is then x — ( e/a)E , 
and the induced dipole moment ex = (e 2 /a)E. Thus the polarizability 
a of a molecule is e 2 /a, and using Eq. (2.4) of Chap. IY, the dielectric 
constant is given by 

- 1 + JV—. (1.1) 

tt€o 

where N is the number per unit volume. 

The value (1.1) is the static value. If the external field varies 
with time, we must take account of the fact that the electron has a 
mass m, thus possessing inertia. We shall also introduce, in addition 
to the external electric force and the elastic restoring force, a damping 
force proportional to the velocity, to account for the absorption. The 
equation of motion for the electron is then 

m mg ^ mo3 ° x = eE} C 1 * 2 ) 

where we have rewritten the linear restoring force in terms of a con¬ 
stant mcog. If E varies sinusoidally with time, as the real part of an 
expression containing the exponential factor e iwt , we can then solve 
(1.2), as always in determining the forced oscillations of a linear 
oscillator, by assuming that x varies also as e la3t . We then find 

^ = (e/m)E 

cog — w 2 jwgr 

That is, the electron vibrates with the same frequency as the external 
field, but with an amplitude depending on the frequency. If we have 
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Nh electrons per unit volume characterized by constants <*>* and g k , the 
polarization is 

p = sV N * e */™ 

L-j 03 % — (a 2 + J&g k 

k 

and the dielectric constant is 

«.=i+ y , 7T - " < i - 3 > 

Zj 03% — C*) 2 + J03gk 
k 

In the limit of low frequencies, where <o can be neglected, this reduces 
ess e nt i ally to the value (1-1), the static dielectric constant, but at each 
of the resonant frequencies 03 k there is a phenomenon such as is found 
with the impedance of an electric circuit, near its resonant frequency. 
The magnitude of the dielectric constant varies rapidly with frequency, 
and at the same time it becomes complex, leading to absorption, as well 
as dispersion. 

It is convenient, in an absorbing medium, to introduce an absorp¬ 
tion coefficient k, as well as an index of refraction n, by the relation 


7 = 3 ~An-jk), 

C 

Then we have 


(n 


g— y* — g— ( u/c) kzg-j ( 03/c) ^ 


jk ) 2 = K e . 


For a gas, there are few enough molecules so that the second term of 
(1.3) is small compared with unity. Thus we have approximately 


n — jk = 1 + 


1 V ___N_kfi/rru o 

2 /Lj 03% — 03 


2 + juQk 


and, if we separate into real and imaginary parts, we obtain 

„ _ , . I V (^-e 2 /m€o)(«j - co 2 ) 

^ 2 4 (cot - <o 2 ) 2 4- 

k 

7 1 V (N k e 2 /m6Q)03g k 

2 L( (03% - 03*)* + <0*1 gi 
k 

If we consider these two quantities as functions of frequency, they 
show the properties we have already discussed. As the frequency 
increases, the index of refraction goes through a phenomenon of 
anomalous dispersion in the neighborhood of each of the resonance 
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frequencies eo*, the index eventually approaching unity, and being 
in fact slightly less than unity, for very large values of frequency; 
while the absorption coefficient is small everywhere except near each 
of the resonance frequencies. The behavior of n and k near resonance 
is as shown in Fig. 24. 

2. Dispersion in Liquids and Solids. —In the case of solids and 
liquids we may no longer make the approximation that the force 
acting on an electron is simply the 
electric vector of the electromag¬ 
netic wave in free space, but must 
take into account the added force on 
the electron due to the polarization 
of the body. We can calculate this 
force as follows: we imagine a 
small sphere of radius R (with its 
center at the position of the electron 
in question) cut out of the medium. 

If we do this, without disturb¬ 
ing the distribution of polariza¬ 
tion outside the sphere, we have 
induced charges on the surface 
of this spherical volume from which we calculate the force at the 
center of the sphere. The surface density of induced charge on a 
spherical ring at an angle 9 to the direction of the field is P cos 0 . 
Since the area of a ring included between angles 6 and $ •+■ dd is 
2-irR 2 sin 9 d$, the charge on the ring is %rPR 2 cos 9 sin 9 d9. This 
charge produces a field at the center of the sphere whose component 
parallel to E is 



Fig. 24.—Anomalous dispersion, 
showing index of refraction and absorp¬ 
tion coefficient as function of frequency. 


dEi 


2ttPR 2 cos 2 6 sin 9 d9 
A.TC€qR 2 


so that the total charge on the spherical surface produces a field at the 
center equal to 

P f* P 

Ei = =— I cos 2 9 sin 9 dd = 5 — 

*€o Jo oeo 


The total electric field at the center of this sphere is then 

P 


E + 


3eo 


Of course, there is still the contribution to the force by the atoms 
inside the little sphere we have cut out, but in an isotropic medium 
this averages to zero. 
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We can now carry over our calculations for gases if we replace E by 
(.E + P/3eo) in the expression for x. Thus we get 


P = 



Nj^Jrn , 
co| — co 2 + jo>gic 


and using the relations D — 




and we find for k 6 


E + f- 

oe o 


K e 


+ 2 


E, 


Ke - 1 (n - jky - 1 = 1 V Nj^/me 0 

k c + 2 (n — jk ) 2 + 2 3 ^1/ co| — o> 2 + .7«<7 a 

k 

If N represents the number of atoms per unit volume, and if /& is the 
fraction of atoms of the /rth type, so that N k = f k N, we may rewrite it 


(n — jk) 2 — 1 _1_ _ 1 fke\/mep 

(n — jk ) 2 + 2 iV 3 cof — o> 2 -|- Jco<7a: 


In this formula, the quantities f k , which are independent of the num¬ 
ber of atoms per unit volume, are often called the "oscillator strengths ” 
of the various resonances. 

For a transparent substance, where we can neglect the damping 
force, and the index of refraction is real, we have for a given frequency 
of light 


n J -ll , , 

— 5 —;—s — = constant, 
n 2 + 2 po 


where po is the density of the body, obviously proportional to N. 
This law, known as the Lorenz-Lorentz law, is surprisingly well obeyed 
for many substances. In many cases it even gives approximately 
correctly the relationship between the index of refraction of a liquid 
and of its vapor. In the limit of very long electromagnetic waves, 
and for the electrostatic case, we have 


--—- — = constant, 

K e “P 2 p 0 

the so-called "Clausius-Mosotti relation” between dielectric constant 
and density. 

There is a different way of handling dispersion in liquids and 
solids, equivalent to this, but exhibiting the result, in different form. 
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We shall consider only the case where there is but one type of oscillat¬ 
ing electron. In (1.2) we replace the right side by e(E 4- P/3e 0 ), 
but also we write P — Nex, which would be the relation for only one 
type of electron, and incorporate the term in a; in the left side of the 
equation, so that the equation becomes 


( 2 . 2 ) 

(2.3) 

That is, we have the same type of anomalous dispersion that we 
found in (1.3) for a gas, but with the resonant frequency displaced 
according to (2.3). If there are various resonant frequencies, it is 
no longer a simple matter to prove that a solution like (1.3) can be 
set up, but the proof can be given, by methods similar to those used 
in discussing the normal coordinates of a coupled vibrating system 
in classical mechanics. We note that, since N is proportional to the 
density, the resonant frequencies appearing in a formula of the type 
of (2.2) vary with the density, whereas the frequencies appearing in the 
Lorenz-Lorentz formula (2.1) are independent of density. 

3. Dispersion in Metals.—The simple electron theory of metals 
assumes that free electrons wander about among fixed ions, and 
carry the current. On the average there is no resultant force acting 
on the electrons, except the external field, so that there is a force eE 
acting on them. The equation of motion is thus similar to (1.2), 
except that there is no restoring force, so that coo is zero. We must 
, assume a damping force, proportional to the velocity, and we find 
that it is simply related to the conductivity, according to Ohm’s law. 
For in the case of a constant external field, the electrons will acquire 
constant velocity, which by (1.2) will be eE/mg, so that, if N is the 
number of conducting electrons per unit volume, the current density is 


d 2 x . 

m w + ma 


dx , ( . iVe 2 \ 

Tt + m ~ SSS) * = eE - 


Solving as in (1.3), we then find 


(n — jk) 2 — 1 + ~ 


3eo m,j 
Ne 2 /m 0 


where 


cog — CO 2 + jeag 


cog = cog — 


Ne 2 
, ■ —— . . . 

3 


Nc, 2 E 

J = Nev = ----- = <rE, 
mg 


Ne 2 ' 

mg 


(3.1) 


This holds only for a steady field, however. If E varies as e? ut , then 
we find from (1.2) that 

Ne 2 

<T — -;- —f 

mg -+- mg co 
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so that the conductivity varies with frequency, vanishing with infinite 
frequency. In considering propagation, we have seen in Eq. (1.5), 
Chap. VIII, that the combination k c — jcr/oieo is to be used in place 
of the dielectric constant in a transparent medium. Using (1.3) for 
k c , this gives 


3<r _ 
mm " • - 

C0€ o 


(n — jk ) 2 = 1-4- 


NeP/me 0 
— co 2 -f- jo>g 


+ 


_ N 

u>\ — 
k 


N / me o 


O) 2 + jcogie 


(3.2) 


in which it is clear that the term coming from the conductivity has 
just the same form as the other terms, except for having its resonant 
frequency equal to zero. Separating real and imaginary parts of 
(3.2), and using (3.1), where we shall now use or for its value at zero 
frequency, we may rewrite (3.2) in the form 

1 <r 1 . V (Vjfce 2 /^eo)(co| — co 2 ) 

geo 1 + c c 2 /g 2 + 4 Ml “ <*> 2 ) + ^g\ 

k 

jr_ _1_, V (jVfcgVmfo) jcogk) 

caeo 1 4- o)*/g 2 (ul — <o 2 ) 2 + oo 2 gf| 

k 

We notice that as the frequency becomes low compared with 
<r/e 0 , which for good conductors is in the ultraviolet part of the spec¬ 
trum, the first term in the product nk becomes large compared with 
unity, masking the effect of the bound electrons. The difference 
n 2 — 7c 2 does not become correspondingly large, so that in the limit of 
low frequencies, n becomes equal to k, and both approach \/V/2aik 0 , 
neglecting co compared with g . However, it is only at low frequencies 
that these simplifications occur. As the frequency enters the near 
infrared or visible region, it becomes of the same order of magnitude 
as both o-/eo and g, so that the contributions of the free electrons 
become complicated, and at the same time nk decreases so that the 
contributions of the bound electrons become important. It is thus 
natural that experimentally the curves of n 2 — 7c 2 and nk throughout 
the visible part of the spectrum are very complicated, though they can 
be fitted fairly accurately with formulas of the type we have derived, 
assuming bound as well as free electrons. In the ultraviolet, the 
contributions of the free electrons become small compared with those 
of the bound electrons having resonance in that region, and a metal 
does not behave essentially differently from an insulator. 

4. The Quantum Theory and Dispersion. —The picture of dis¬ 
persion and of the optical properties of dielectrics and metals that 


n 2 — k 2 — 

2nk = 
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we have presented is based on simple classical models of the behavior 
of electrons in these materials. We have assumed that dielectrics 
contain electrons held to positions of equilibrium by linear restoring 
forces, subject to resistances proportional to their velocity, and we 
have assumed the conduction electrons in a metal to be similar, except 
that they have no restoring forces, but only resistance. The theory 
in this form was developed by Drude and Lorentz, in the early days 
of the electron theory. It has proved in practice to be so good as to 
surprise one, when the crudity of the assumptions is considered. The 
real electrons in atoms, molecules, and metals, as we now know from 
the quantum theory, behave very differently from our simple picture, 
but it is a remarkable fact that the quantum theory leads essentially 
to a justification of our mathematical formulation, though not of the 
simple hypotheses underlying it. According to the quantum theory, 
as is well known, atoms, molecules, and other systems have certain 
energy levels in which they can exist, and the emission and absorption 
of radiation are associated with transitions between energy levels, 
the frequency v associated with two levels E\ and E^ being given by 
Planck’s relation E<z — E\ — hv, where h is Planck’s constant. Atoms 
and molecules have only a discrete set of excited energy levels, so that 
from the ground state they can absorb a discrete set of definite fre¬ 
quencies. Metals, on the other hand, have a continuum of excited 
levels, associated with the free electrons. 

In the quantum theory, we can investigate the behavior of an 
atom, molecule, or metal, under the action of an external radiation 
field. We find, by application of certain perturbation methods, that 
the effect of the atom on the field can be replaced by an equivalent 
set of linear oscillators, associated with the various transitions that 
are allowed by Planck’s relation. Each oscillator has a resonant 
frequency given by one of the allowed transitions. With N atoms 
per unit volume, however, we do not have the equivalent of N oscillat¬ 
ing electrons per unit volume of each of the frequencies; we have 
instead only Nf, where / is a fraction, sometimes called the il oscillator 
strength,” and which can be calculated by quantum mechanics. We 
further find that the oscillators have not only resonant frequencies, 
but also damping terms, which can be correlated with the broadening 
of the upper and lower states, by collisions with other atoms and 
other perturbing processes. Thus the net result of the quantum 
discussion # is a theory mathematically equivalent to the one developed 
in the present chapter, but with quite a different physical interpreta¬ 
tion. It is for this reason that a purely classical discussion of dis- 
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persion, such as we have given, is of real physical importance, and not 
merely an academic matter* 

In metals, the quantum theory also leads to quite a different picture 
from an elementary classical theory. Electrons are not governed by 
classical statistics, but by Fermi statistics, as a result of which the 
electrons in a metal are not at rest in the absence of a field, as our 
classical equation of motion would suggest, but actually are in continu¬ 
ous motion with a very high velocity. In the absence of an external 
field, however, as many are traveling in one direction as in the opposite 
direction, so that there is no net current. In the presence of a field, 
the electrons gain an average acceleration that is essentially what 
corresponding classical electrons would experience, though in most 
cases they have an effective mass that is different from their classical 
mass, because of the structure of the energy bands of the metal. The 
electrons, once they are accelerated, are subject, in quantum theory 
as in our classical picture, to a frictional resistive force, but we can 
give a physical explanation of the friction, instead of merely postulating 
it. The electrons have many of the properties of waves, as is observed 
experimentally in electron diffraction, and these waves are scattered 
by the irregularities in the metal produced by the thermal oscillations 
of the atoms. It is this scattering which produces the effect of fric¬ 
tion, dissipating any average momentum that the electrons may 
acquire. Thus the quantum theory eventually arrives at a picture 
of the electrons in a metal that is not unlike that of elementary classical 
theory, and the quantum theory of the interaction of metals with a 
radiation field essentially verifies the simple theory that we have 
described in this chapter. 

This brief discussion of the relation of the classical theory of dis¬ 
persion to the quantum theory is not expected to give a clear idea 
of the quantum phenomena to one not already familiar with them, 
but is intended merely to make it plain that, as we have already 
mentioned, the classical theory of dispersion is of great importance 
physically, furnishing a description of experimental facts that in its 
broad outlines is correct, and correlates with the quantum theory. 


Problems 


1. Show that, in the case of normal dispersion for the visible spectrum where 
there is an absorption band in the ultraviolet, the index of refraction can be written 

diS 


, . , B , C , 

n‘=A +T - 1+Tt + 


% 


where X is the wave length in vacuum and A, B, C are constants. 
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If there is also absorption in the infrared, show that the index of refraction is 
then given by 

tl 2 = A r-r -}- t - ; -h • ■■ — A/X 2 — B'\* • • • . 

A® X 4 

2. Measurements of H 2 gas give the following values of the index of refraction: 


X in A 

{n - 1)10 7 

5,462.260 

1,396 

4,078.991 

1,426 

3,342.438 

1,461 

2,894.452 

1,499 

2,535.560 

1,547 

2,302.870 

1,594 

1,935.846 

1,718 

1,854.637 

1,760 


Using the expression in Prob. 1 for n 2 in reciprocal powers of X, calculate the best 
values of A, B, and C. If the measurements are made at room temperature and 
atmospheric pressure, calculate the resonant frequency w 0 and wave length from 
these constants. 

3. Prove that in the case of anomalous dispersion for gases the maximum and 
minimum values of n occur at the positions where the absorption coefficient reaches 
half its maximum value. Find the relation between Qk and the half width of the 
absorption band. Assume gk/u>k<£ 1. 

4. For the D line of sodium the following values of the constants in the disper¬ 
sion formula are found: 

Ne 2 

o> 0 = 3 X lO^ 6 ; g = 2 X 10 10 ; — = 10 23 . 

Plot the index of refraction n and the absorption coefficient k as a function of the 
frequency of the light. Find the maximum and minimum values of the index of 
refraction n. Find the maximum value of the absorption coefficient k and the 
half width of the absorption band in angstrom units. 

6. Show that for gases the Liorcnz-Lorentz law takes the approximate form 

?. n ~ 1 = constant. The following measurements have been made on air (po 
3 pu 

given in arbitrary units) • 

P n n 

1.00 1.0002929 

14.84 1.004338 

42.13 1.01241 

69.24 1.02044 

96.16 1.02842 

123.04 1.03633 

149.53 1.04421 

176.27 1.05213 

Calculate - - 1 and “ for each of these measurements, and compare the 

' ‘ ' 3 po W 2 + 2 pn 

constancy of the results (calculate to four significant figures). 
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6. The indices of refraction for the sodium D line, and densities in grams ner 
cubic centimeter of some liquids at 15°C are 


Liquid 

Po 

n 

Water. 

0.9991 

1.3337 

1.6320 

1.3558 

Carbon bisulphide. 

1.2709 

Ethyl ether. 

0.7200 



Calculate the indices of refraction for the vapors at 0°C and 760 mm pressure 
The observed values for the vapors are 1.000250,1.00148, and 1.00152, respectively! 

7. The quantity 0^- ^ is called the “refractivity” of a substance if rrt 

denotes its mass. Prove that the refractivities of mixtures of substances equal 
the sum of the refractivities of the constituents. (Neglect damping forces from 
the start.) 

8 . Show that the molecular refractivity of a compound, defined as — M, 

^2 2 pQ 

where M is the molecular weight, is equal to the sum of the atomic refractivities of 
the atoms of which the compound is formed. (Neglect damping forces.) 

9 . Forthe following gases we have the following values of (n — 1) „ extrapolated 
to long wave lengths: 


Gas 

H,. 

N,. 

0 2 . 


( n “ 1)«' io 6 

.. 136.35 
.. 294.5 
.. 265.3 


Calculate the values of (n - 1)„ for the following gases: H a O, NH 3 , NO, N 2 0<, 0». 
The measured values are 245.6, 364.6,288.2, 496.5,483.6, all times 10 s . Find the 
percentage discrepancy between the calculated and observed values. 









CHAPTER X 


REFLECTION AND REFRACTION 
OF ELECTROMAGNETIC WAVES 

In Chap. VIII we investigated the behavior of a plane wave in a 
homogeneous medium, and showed that it is propagated with a veloc¬ 
ity equal to the velocity in free space, divided by the index of refrac¬ 
tion. Next in Chap. IX we investigated the physical nature of 
various types of media, and found the features leading to various 
indices of refraction, their variation with wave length, and to the 
absorption that often accompanies dispersion. Now we shall consider 
what happens at an infinite plane boundary between two semi-infinite 
media of different indices of refraction, such for instance as free space 
and a dielectric, or free space and a metallic conductor. We shall 
be led to the familiar laws of reflection and refraction, laws that 
were established, before the electromagnetic theory, merely from 
general wave theory. To consider the behavior at the boundary, we 
must find the boundary conditions holding at a surface of discontinuity 
between two media. 

1. Boundary Conditions at a Surface of Discontinuity. —In each of 
our two media, we assume that the solution of Maxwell’s equations 
that we desire is a plane wave, just as in an infinite medium. At the 
boundary between the media, however, certain boundary conditions 
are to be met, and these demand that there be definite relations 
between the waves in the two media. These conditions have been 
derived earlier, in Eq. (3.3) of Chap. IV for the electric vectors, and 
in Eq. (3.2) of Chap. VI for the magnetic ones. We rewrite them: 

Normal components of D and B are continuous, 

Tangential components of E and H are continuous, (1.1) 

at a surface that contains no charge and current. These conditions 
will prove to be enough to derive the complete laws of reflection and 
refraction; in fact, the conditions on D and B follow from the others, 
and are not necessary for the derivation. We shall find that in general 
we cannot satisfy them without postulating three separate plane 
waves: an incident and a reflected wave in one medium, a refracted 
wave in the other medium. By arguments like those of elementary 

117 



118 


ELECTROMAGNETISM 


[Chap. X 


optics, involving just the matching up of wave fronts on both sides 
of the surface of separation, we can prove the simple laws of reflection 
and refraction: that the angles of incidence and reflection are equal, and 
Snell’s law of refraction. We can go further than this, however, and 
compute the intensities of the reflected and refracted waves, and 
hence the reflection coefficients, embodied in Fresnel’s equations. 

We shall first prove the laws of reflection and refraction, and then 
Fresnel’s equations. Then yve shall take up two more complicated 
cases, which cannot be handled by the most elementary methods: 
total internal reflection, in which there is an exponentially damped 
wave of an interesting type in the rarer medium, which does not lead 
to a flow of power into the medium; and reflection and refraction by a 
conducting medium such as a metal, in which Snell’s law of refraction 
does not hold in its simple form. In all this discussion, we notice that 
our treatment holds for any part of the spectrum with equal validity, 
from the longest electromagnetic or radio waves, through the visible 
region, to the X rays and gamma rays. The only difference is that the 
index of refraction and absorption coefficient vary with wave length, 
in the manner we took up in the preceding chapter. These quantities 
will appear as constants in our present discussion, however, which will 
deal throughout with monochromatic waves, of a definite wave length. 

2. The Laws of Reflection and Refraction. —Let us assume that the 
surface of separation is z — 0, and that the medium with negative z 
has constants n, «, and that for positive values has //, e', with cor¬ 
responding indices of refraction and absorption coefficients n, k, and 
n r f k'. Let the incident wave be in the medium with negative z, and 
let its wave normal have direction cosines l, m, n. Then, in a non¬ 
absorbing medium, which alone we consider in the present section, the 
exponential factor representing the wave propagation is 

—(Zx-f-my+nz) /v)\ ^2 

where v — c/n is the velocity of propagation. We shall simplify by 
assuming that the wave normal is in the xz plane, and that the angle 
of incidence, or angle between the wave normal and the z axis, is i ; 
then we have l sin z, m = 0, n — cos i } so that (2.1) may be rewritten 


gjo>[t— (x sin i4*z cos i)/v ] ^2 2 J 

Similarly in the medium with, positive z 7 there will be a wave propa- 
gated at an angle r with the normal, r being the angle of refraction; its 
corresponding exponential factor is ? 


—(x sin rArz cos r)/u'] 


( 2 . 3 ) 
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Along the surface of separation, z = 0, the two exponentials must 
agree; for otherwise, even if we were able to satisfy boundary condi¬ 
tions at one point, these would not hold at other points of the surface. 
Thus we must have 


sin i _ v _ n' 
sin r ~ v~' ~ ~n’ 


(2.4) 


which is Snell's law of refraction. Our derivation is simply the 
analytical statement of the elementary fact that the wave fronts in 
the two media must match each other. In addition to the refracted 
wave, we can set up in the first medium a reflected wave, whose z 
component of propagation is reversed, with the exponential 


g7«[*—(a* sin v— z cos i)/v] 


(2.5) 


Along the plane z = 0, this exponential agrees with the other two, so 
that the boundary conditions can be satisfied; and a little consideration 
shows that it satisfies the law of reflection, that the angle of reflection 
equals the angle of incidence i. 

3. Reflection Coefficient at Normal Incidence.—We shall next set 
up the values of the electric and magnetic vectors on both sides of 
the boundary, use the boundary conditions (1.1), and derive the 
relations between the amplitudes of incident, reflected, and refracted 
waves. As a preliminary simple case, let us consider the case of 
normal incidence, when i and r are zero, so that the exponentials 
(2.2), (2.3), and (2.5) are independent of x. There will be no normal 
components of D and B, since the fields are transverse to the direction 
of propagation, which in this case is the z axis. In the incident wave, 
let E be along the x axis, H along the y axis. From Eq. (2.1) of 
Chap. VIII, we have 



Similarly for the refracted wave we assume E is along x, H 9,long y } 
and have 



For the reflected wave, either E or H must be reversed in phase with 
respect to the incident wave, since the wave travels in the opposite 
direction. Thus we have 


E 


ft 


Hy 



(3.3) 
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Our boundary conditions may then be stated with respect to E: 

4 

E, + B» = V m (E„ - m) = ^ E'„. 

Eliminating E' x , we find for the ratio of reflected to incident amplitude 

m _ VTh. - vg/g = zi - Zo 

E x -f- \/e'/fjf Zq + Zq w*4) 

The ratio of reflected H to incident H, except for the change of sign, 
is the same; thus the ratio of reflected power to incident power, or 
the reflection coefficient for power, is the square of the quantity given 
in (3.4). 

In most common cases, the media are nonmagnetic, and 

M = n' = fXQ, 

the value characteristic of free space. In this case, from (1.9), 
Chap. VIII, the square roots in (3.4) are proportional to the cor¬ 
responding indices of refraction, and the reflection coefficient for 
power is 

Reflection coefficient for power = ~ ~~ • (3.5) 

We note that, if n and n' are interchanged, or the incident wave is 
approaching the surface from the other side, the reflection coefficient 
is unchanged; a surface always reflects equally in both directions. 
The reflection coefficient is of course less than unity. We can easily 
find E' SJ the amplitude of the refracted wave, if we choose, and show 
that the power carried to the surface by the incident wave is split up 
between the reflected and refracted waves, so that, if any energy is in 
the refracted wave, the reflected wave must be weaker than the inci¬ 
dent, and the reflection coefficient must be less than unity. For 
instance, for a wave of light reflected at a surface between glass and 
air, we have n = 1 for air, and about 1.5 for glass; in this case the 
coefficient is (0.5) 2 /(2.5) 2 = i^ 5j showing that only 4 per cent of the 
intensity is reflected from a glass plate at normal incidence. 

4. Fresnel’s Equations.—Next we take the case of an arbitrary 
angle of incidence. Here we meet the question of polarization. The 
vector E is at right angles to the direction of propagation, but that 
does not fix the direction uniquely, and it is said that the wave is 
polarized in a particular direction if its electric vector (or, according 
to an alternative convention, its magnetic vector) points in that direc¬ 
tion. Let us then consider the two extreme cases. We take the wave 
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normal of the incident wave to be in the xz plane, as before. Then 
we consider the case where the electric vector is along the y axis, and 
the case where it is in the xz plane, as in Fig. 25. In case 1, the y axis 
points down into the paper, E and E' point down, and E" points up; in 
case 2, H, H', and H" all point down, along the y axis. We now discuss 

these cases separately. 




Fig. 25.—Vectors in reflection and refraction, 
paper. E and E' point down, E" points up. 


Case 1: y axis points down, into 
Case 2: H, H', H" all point down. 


the 


Case 1.—The electric vector is along the y axis, or at right angles 
to the plane of incidence. All vectors depend on space in the way 
indicated by (2.2) for the incident wave, (2.3) for the refracted wave, 
and (2.5) for the reflected wave. From the figure, we see that for 
the incident wave H x = -(. E y /Z 0 ) cos i, where Z 0 is given in (3.1). 
Similarly for the refracted wave, H' x — — (E' y /Z' Q ) cos r, and for the 
reflected wave H" - -(E' y '/Z 0 ) cos i. Hence we have the following 


relations: 


Tangential component of E: E v E y E y . 

COS 

Tangential component of ll : - (Ey + K) -J- 



cos r 
Z'o ‘ 


Remembering Snell’s law, the second may be rewritten 


Ey + E'J = E y 


jli tan 
H' tan r 


From this at once, multiplying the first by (M/V)(tan i/tan r), and 
subtracting, we have 


/ M tan i _ A E „ (jt 
\J7 ten r V " V 


tan r 

E ft 

y _ 

Tv ~ 


i 

> 

ii tan i 


tan i 


tan r 
■’ tan r 




tan i + y! tan r 
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If both media are nonmagnetic, so that y = /j.', as is usually the case, 
this becomes 


Ey __ tan i — tan r __ sin i cos r — cos i sin r 

E y tan i + tan r ~ sin i cos r + cos i sin r 

E'J _ sin (£ — r) 

E y sin (i -f* r) 


(4.1) 


This gives the amplitude of the reflected wave, and is one of Fresnel's 
equations. We note that, as i and r become zero, the law of refrac¬ 
tion becomes i/r = n'/n, i — (n'/n)r. Thus, in the limit of normal 
incidence, the ratio approaches ( n' — ri)/(n' + ri), as we found above 
(when we correct the signs for the different convention used here). 
We also note, in the other extreme of tangential or grazing incidence, 
that i = 90°, so that the ratio is [sin (90° — r)]/[sin (90° + r)] = 1. 
That is, the reflection coefficient equals unity for grazing incidence. 
The formula gives a monotonic increase of amplitude as the angle of 
incidence increases. 

Case 2.—The electric vector is in the xz plane, or in the plane of 
incidence. We let H be along the y axis in all waves, and let E } 
E', E" be the magnitudes of E in the three waves. Then H y — E/Z 0 , 
H y = E'/Zq , H f y f = E"/Z o. For the components of the electric vec¬ 
tors, we have E x = E cos i, E' x = E' cos r, E” = — E" cos i. Then 
we have 

/ 

Tangential component of E: {E — E") cos i — E' cos r 
Tangential component of H: — ~jl — ^ = ~ r 


We may rewrite the first as E — E" = E' (cos rf cos i), the second as, 
E -f- E" — (Z 0 /Z' 0 )E'. Multiplying the first by 


Zc 

zs 


io _ ( M ( n \ _ ( p\( sin A 
S V* / \ n ) \MV \sin r/ 

the second by cos r /cos i, and subtracting, we have 


E 


( J± si n i __ cos r \ _ ^ lff /ju sin i cos r \ 

fi' sin r cos i) \/x' sin r cos i / 


E 

~E 


cos 

At sin i cos i — ^ sin r cos r 
fjL sin i cos i + ix' sin r cos r 


If M = m', we can use the trigonometric relations 
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sin (z ± r) cos (z + r) 

= (sin z cos r ± cos i sin r) (cos z cos r ± sin z* sin r) 

= sin i cos z*(cos 2 r + sin 2 r) ± sin r cos r(sin 2 z + cos 2 z) 
= sin z* cos z ± sin r cos r. 

Hence we have 

F 7/ _ sin (z — r) cos (z H~ r) _ tan (z — r) . 

2? sin (z ■+• r) cos (z — r) ~ tan (i-f 0 \ m ) 

This is the other of Fresnel's equations. 

There is one interesting feature met in this case, which is not 
present when the electric vector is at right angles to the plane of 
incidence: if i + r — 90°, a perfectly possible situation, the denomi¬ 
nator of (4.2) is infinite, so that the reflection coefficient is zero. This 
angle is called the “polarizing angle”; if unpolarized radiation, 
consisting of a mixture of both types of radiation, falls on a surface 
at this angle, only the radiation with its electric vector at right angles 
to the plane of incidence will be reflected, and the reflected radiation 
will be polarized. It was by this phenomenon that polarized light 
was first discovered. Light was reflected from one mirror at this 
angle. Then its polarization was found by reflecting from a second 
mirror at the same angle. As the second mirror was rotated about 
the beam as an axis, so that the polarization changed from being at 
right angles to the plane of incidence to being in the plane, the doubly 
reflected beam changed from a maximum intensity to zero. The 
polarizing angle r' is fixed by %' + r' — 90°, and this occurs when 
cos z' = sin r'. Using the law of refraction, we find 

, n' 

tan z = —j 
n 

thus fixing the polarizing angle i '. 

5. Total Reflection. —For radiation passing from a dense medium 
to a rarer medium, so that n' < n, the angle of refraction becomes 
90° for an angle i for which sin i — n'/n. For greater angles of 
incidence, the equation sin r == (n/n') sin i would indicate that sin r 
should be greater than unity, so that r must be imaginary. To find 
the physical meaning of this situation, we compute cos r, which is 
given by 

cos r = + \/l — sin 2 r = ± j \/ (zz/n') 2 sin 2 i — 1. 

The expression (2.3) for the disturbance in the rarer medium then 
becomes _ 

gjo)[t — (jt sin —«*)[■%/(n/n') 2 sin 2 »— 11 z/v'^ 
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where we have used the negative square root. The first term, repre¬ 
sents a wave propagated along the x axis, or parallel to the surface 
of the medium, with an apparent velocity v'/sin r, a value less than v'. 
The second factor indicates that the amplitude of this wave is damped 
out as z increases, or as we go away from the surface, so that the wave 
fronts (surfaces of constant phase) are at right angles to the surfaces 
of constant amplitude. This disturbance ordinarily damps out in a 
very short distance. Thus if ( n/n f ) sin 2 i is decidedly greater than 1, 
the exponential becomes small when z is a few wave lengths («z/V 
a reasonably large number). Consequently the disturbance in the 
rare medium is not observed, unless special experiments are devised 
to find it. It is easily shown that Poynting’s vector for this wave 
has no component normal to the surface, so that it does not carry any 
energy away. Thus all the energy in the incident wave must reappear 
in the reflected wave, and for this reason the phenomenon is called 
“total reflection.” The angle of incidence given by sin i — n'/n, 
which must be exceeded in order to have total reflection, is called the 
“ critical angle.” 

Although there is no change of amplitude on reflection in this 
case, there is a change of phase, which is sometimes of interest, and 
which may be treated by Fresnel’s equations. Thus in case 1 we 
have, assuming nonmagnetic media, 

E" _ sin i cos r — cos i sin r _ -\/sin 2 i — (n'/n) 2 — j cos i 

E ~ sin i cos r + cos i sin r “ Vsin 2 i - (n'/n) 2 -b ~j cos i 

and in case 2, 

E^_ _ _ (n/n') 2 •%/sin 2 i — (n’/n) 2 — j cos i 

E ( nfn ') 2 s/ sin 2 i — (n'/n)* + j cos i 

In each case, E"/E is the ratio of a complex number to its complex 
conjugate, which is therefore a complex number of unit magnitude 
(showing that the reflection coefficient is unity), but with a certain 
phase angle. Thus, in the general case, where E has components both 
in the xz plane and along the y axis, there is a difference of phase 
between these components upon total reflection, and linearly polarized 
light in general will become elliptically polarized upon total reflection. 
To see this, we note that two vibrations at right angles, with the same 
frequency and phase, produce a resultant vector whose extremity 
moves in a line (plane polarization), but if the two components are in 
different phases the extremity of the vector traces out an ellipse. 
If the phases differ by 90°, and the amplitudes are equal, the polariza¬ 
tion is circular. 
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6. Damped Plane Waves, Normal Incidence.— So far, we have 
taken up only the case of nonabsorbing media, with real indices of 
refraction. The case of reflection by an absorbing medium, such as a 
metal, can be handled, as we saw in Chap. IX, by replacing the index 
of refraction n by the complex quantity n — jk. Considering the 
case of normal incidence, using (3.4), and setting fx = /x', we have 

E" _ n — (»' — jk) _ (n — n') + jk' . 

E, n + («' - jk') {n + n') - jk" 

on the assumption that the wave is incident on an absorbing medium 
( n',k ') from a nonabsorbing medium. Since this ratio is complex, 
we see in the first place that there is a phase change on reflection 
from an absorbing medium. To find the reflection coefficient R , we 
must take the square of the magnitude of the quantity in (6.1). This 
can be done by multiplying by the complex conjugate. We have 

7? — — VlH 

— (n + n') 2 -f k ' 2 

It is interesting to consider the behavior of a metal at relatively 
low frequencies. We saw in Chap. IX, Sec. 3, that at frequencies 
below the shorter part of the infrared, n' and k' both approach 'v/v/2w€ 0 . 
Furthermore, both these quantities are very large compared with 
unity, or with n, the index of refraction of the transparent medium in 
question. Substituting these values, the reflection coefficient becomes 

R = 1 - 2n (6.2) 

According to our assumptions, the second term is small; thus nearly 
all the incident radiation is reflected. Furthermore, the second term 
becomes smaller as the frequency is reduced, or as the conductivity 
increases. In these cases, very little power flows into the conductor, 
and is dissipated in it because of the damping of the wave, or, in 
physical language, because of the dissipation of heat in the resistance. 
The reason so little power flows into the metal is easily seen, from the 
relation (2.1) of Chap. VIII for the ratio of E to H. From that 
equation, in the limit of low frequency, we have 



if we assume that f x = m o- That is, the ratio of E to H is much smaller 
than the value y/ mo/«o characteristic of empty space. It is so small, 
in fact, that the E within the metal can almost be neglected. Thus 
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Poynting’s vector at the surface is very small, representing a small 
energy flow into the metal. The situation is almost like that with a 
perfect conductor, in which the electric vectors of the incident and 
reflected waves exactly cancel at the surface of the metal, whereas 
the magnetic vectors are equal in magnitude, because of the perfect 
reflection, and add. 

Looking back to the value of y for a metal, from Eq. (1.5) of 
Chap. VIII, we see that in our limiting case 

y = +ju */— = +j \/ — jcocr/x. 

This tells us in the first place that the real and imaginary parts are 
equal [since j y/—j = (1 + j)/\/ 2], so that a wave inside a good con¬ 
ductor damps down to a small fraction of its intensity in a few wave 
lengths; and that the distance in which the intensity falls off to a 
fraction of its value at the surface decreases as either the frequency or 
conductivity increases. It is often convenient to define the distance 
in which the amplitude falls to \/e of its value as the skin depth, for 
this damping of the wave inside the conductor is simply another way 
of describing the skin effect, familiar in high-frequency electric-circuit 
theory. We have for 6, the skin depth, 

5 - (o - 3) 

For the limiting case of a perfect conductor, we then see that the 
reflection coefficient is unity; the tangential E at the surface is zero 
(as it must be, for otherwise the current would have to be infinite), 
but the tangential H is finite; but that the field within the conductor 
is a damped wave that falls off infinitely rapidly to zero as we penetrate 
the metal, so that directly below the surface //, as well as E, is zero. 
In other words, to take account of the rapid decrease of the tangential 
H from a finite value at the surface to zero directly below the surface, 
Stokes s theorem shows that we must assume a surface-current density 
at the surface of the conductor, numerically equal to the tangential IT , 
but at right angles to it, which because of the perfect conductivity 
can flow without a corresponding tangential electric field. These 
limiting boundary conditions are often convenient to use directly in 
discussing the boundary conditions of an electromagnetic wave 
reflected by a perfect conductor, as for instance in considering propaga¬ 
tion in wave guides, and electromagnetic fields in resonant microwave 
cavities. 
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7. Damped Plane Waves, Oblique Incidence.—If a plane wave 
approaches the surface between a transparent medium and an absorb¬ 
ing medium, such as a metal, at oblique incidence,, the problem is 
considerably more complicated than any that we have taken up so far, 
and we shall not give a complete discussion. (A more complete treat¬ 
ment is given in J. C. Slater, Microwave Transmission, Sec. 13, McGraw- 
Hill Book Company, Inc., New York, 1943.) We can see the reason 
for the complication in a very simple way. The wave normal in the 
metal will be at an angle of refraction r, which certainly will not bring 
it along the z axis, or the normal to the surface. Thus the surfaces 
of constant phase, or the wave fronts, in the metal, will not be parallel 
to the planes z = constant. On the other hand, the amplitude is 
constant over the surface z — 0, the surface of the metal; thus it will 
also be constant at surfaces z = constant within the metal, since at all 
points of such a surface the wave will have penetrated equal distances 
through the absorbing medium, and will have had equal energy losses. 
Thus the surfaces of constant phase do not coincide with the surfaces of 
constant amplitude. 

We have already seen, in Sec. 5, an example of a case where these 
surfaces do not coincide; in that case, in fact, they were at right angles. 
We may get a general approach to this problem by assuming, in place 
of (1.4) of Chap. VIII, a general solution of the wave equation of the 
form where y and r are now vectors, and where y can be called 

the “propagation vector.” In Chap. VIII, we could have used this 
form, with the assumption that y pointed along the z axis. We saw 
in (1.6) of Chap. VIII that y could be complex; but now we see that an 
additional complication can arise, in that the real and imaginary parts 
of y, which we can call a and (3, can be vectors that do not have to have 
the same direction in space. If we substitute our expression in the 
wave equation, we find that, if there is no loss in the medium, a and 5 
must be at right angles to each other, and there is a definite relation 
between their magnitudes. This is the situation met in total reflec¬ 
tion, where we have a damped wave, but in a nonabsorbing medium. 
The physical situation is clear: since there is no loss, the intensity 
of the wave must be constant along the direction of propagation, but 
there is nothing to prevent its intensity varying at right angles to this 
direction. 

On the other hand, in an absorbing medium, the relation between 
the directions of « and (3 is quite arbitrary. Thus we can always 
choose the direction of a so as to make the surface of the metal a 
surface of constant amplitude, and yet can choose the direction of § 
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so a's to satisfy the law of refraction. We find, however, that the law 
of refraction is no longer the simple form of SnelPs law. The angle of 
refraction acts like a complex angle, and we can modify SnelPs law, if 
we choose, to give the correct results, by taking a complex index of 
refraction, and complex angle of refraction. The net result of the 
law of refraction, however, is very simple for a metal of good conductiv¬ 
ity. We have seen that its index of refraction n', as well as the absorp¬ 
tion coefficient 7c', are very large. The refracted wave within the 
metal resembles that in an ordinary case of refraction to the extent 
that, with a very large index of refraction, the angle of refraction is 
very small, or the wave normal of the refracted wave is nearly along 
the normal to the surface. In other words, quite independent of the 
angle of incidence, the wave inside the metal is similar to that dis- , 
cussed in Sec. 6. There is, however, a change of phase on reflection, 
which we can get from Fresnel’s equations, treating the angle of 
refraction as complex, which is a function of the angle of incidence, 
and which is different for the two planes of polarization of the incident 
wave. Thus it can happen that the state of polarization of the incident 
wave can be changed by reflection, in a characteristic manner, and 
this gives one of the practical experimental methods for finding 
information regarding the optical constants of a metal. 

Problems 

1. Light is reflected from glass of index of refraction 1.5. Compute and plot 
curves for the reflected intensity as a function of angle, for both sorts of plane 
polarization. 

2. Find the intensity of light in the refracted medium, for arbitrary angle of 
incidence and both types of polarization. Show that the amount of energy strik¬ 
ing the surface is just equal to the amount carried away from it. Note that the 
amount striking the surface is computed, not from the whole of Poyn ting's vector, 
but from its normal component. 

3. Light passes normally through a glass plate. Find the weakening in inten¬ 
sity because of the reflection at the faces. 

4. Ten plates of glass of index 1.5 are placed together and used as a polarizer. 
Light strikes the plates at the polarizing angle, and the transmitted light is used. 
Since all the reflected light is of one polarization, and the reflection at both surfaces 
of all plates is enough to remove practically all the light of this polarization, the 
transmitted light will be practically polarized in the other direction. Find tho 
intensity of both sorts of light in the transmitted beam, assuming initially unpolar¬ 
ized light, and hence show how much polarization is introduced. You rnay have to 
consider multiple internal reflection. 

5. The resistivity of copper is about 1.7 X 10 -ft ohm-cm. Calculate the reflec¬ 
tive power of copper for wave lengths of light X = 12/* and X = 25.5/*. The 
observed values of 1 — R are 1.6 per cent and 1.17 per cent at these wave lengths. 



CHAPTER XI 

WAVE GUIDES AND CAVITY RESONATORS 

If electromagnetic radiation is introduced into the open end of a 
hollow pipe bounded by perfectly conducting, and hence perfectly 
reflecting walls, the radiation will be reflected from the wall whenever 
it strikes it, and will be able to progress down the pipe for an indefinite 
distance. Such a hollow pipe is called a “wave guide,” and has recently 
come into much prominence in connection with the applications of 
microwaves, or electromagnetic waves whose lengths are of the order 
of magnitude of laboratory dimensions. The propagation of waves 
down a wave guide is not quite so simple as it seems at first sight; 
for the various reflected waves interfere with each other, in such a way 
that only certain types of waves can exist in the guide. In particular, 
the field pattern of the wave in a plane perpendicular to the axis can 
take on only one of a discrete, though infinite, number of character¬ 
istic patterns, called “modes,” determined by the shape and size of the 
cross section. Corresponding to each of these modes, a wave is 
possible in the guide for any given frequency, and the effective wave 
ength as measured along the guide, the so-called “guide wave length,” 
is determined in terms of the frequency. 

For each frequency and guide wave length we can at once deter¬ 
mine the phase velocity of propagation along the axis of the guide, and 
we find that in most cases this phase velocity is greater than the 
velocity of light, a fact that does not contradict the principle of rela¬ 
tivity, since it can be shown that no signal can be propagated with this 
phase velocity. The phase velocity varies with frequency; that is, the 
propagation shows dispersion, just like the propagation of light in a 
dispersive medium, which we discussed in Chap. IX. In fact, as the 
frequency decreases, the guide wave length, and the phase velocity, 
increase without limit, both becoming infinite at a quite finite fre¬ 
quency. At still lower frequencies, the disturbance, instead of being 
propagated along the wave guide, is exponentially attenuated or 
damped, at a more and more rapid rate as the frequency approaches 
zero. The frequency at which the disturbance changes over from 
being propagated to being attenuated is called the “cutoff frequency,” 
and the corresponding wave length of a wave in free space is called 
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the “cutoff wave length.” Since the guide will propagate only waves 
of f re e-space wave length shorter than the cutoff wave length, or of 
frequencies higher than the cutoff frequency, it forms a filter, passing 
only high frequencies. The cutoff frequency is different for each 
mode. Thus, for frequencies less than the lowest cutoff frequency, no 
wave can be propagated down the guide; between the lowest and next 
lowest cutoff frequencies, only one mode can be propagated; and so on. 
As a practical matter, wave guides are usually used in the region 
between the lowest and next lowest cutoff frequencies, so as to have 
only one mode propagated, the rest attenuated. For certain special 
types of wave guides, the lowest cutoff frequency is zero, so that in 
such cases any frequency, no matter how low, can be propagated. We 
shall find that in such a case this mode of zero cutoff frequency, called 
the “dominant mode,” has the interesting property that the velocity 
of propagation is equal to the velocity of light in free space, so that 
this mode, unlike the others, does not show dispersion. 

If a length of wave guide is closed by two perfectly reflecting 
planes, radiation propagated down the guide will be reflected back 
and forth by the planes, and will form standing waves. Furthermore, 
in order that the successive reflections may reinforce each other, 
it is clearly necessary that the round trip, from one plane to the 
other and back again, should occupy a whole number of periods of the 
vibration, or that this distance should be a whole number of guide 
wave lengths. Thus the guide wave lengths, and hence the fre¬ 
quencies, are determined in terms of the dimensions. Such a guide 
with closed ends forms a simple example of a cavity resonator. It is 
found, in fact, that any closed region bounded by perfectly reflecting 
walls can vibrate in certain normal modes, each of a definite frequency 
fixed by the geometry of the cavity, there being an infinite number of 
such normal modes of greater and greater frequency. Such resonant 
cavities form the equivalent, in the microwave range of frequencies, 
of resonant circuits in the ordinary range where electric circuits are 
used. It is obvious that wave guides with closed ends, forming 
resonant cavities, form a close analogy to organ pipes, and it is not 
surprising to find that Lord Rayleigh, some of whose best known work 
was in the theory of sound, worked out the theory of wave guides and 
of cavity resonators in the latter part of the nineteenth century. 
Their practical application, however, has waited until the last few 
years, when techniques for producing the oscillations of the required 
very high frequency have become available. 

The properties of wave guides, which we have outlined above, 
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seem at first sight strange and unexpected, though it is very easy 
to prove these properties by setting up appropriate solutions of Max¬ 
well’s equations. A little further consideration, however, shows 
that the properties of wave guides follow largely from the discussion 
of reflection which we have already given. Even the attenuated waves 
for frequencies below cutoff prove to have their simple explanation, 
and to be closely related to the attenuated wave in the rarer medium 
in the problem of total internal reflection, the cutoff phenomenon 
having an analogy to the critical angle. Accordingly, so as to make 
the subject physically clear, we shall start our discussion by consider¬ 
ing a very simple problem, the propagation of radiation between two 
parallel plane mirrors, which shows many of the properties met with 
wave guides. We shall not push this method too far, however, for 
in fact it is only with rectangular wave guides that one can build 
up the solution from a study of reflection by plane walls. In more 
complicated cases, such as the circular guide, this method is not 
available, and yet we can set up a general treatment that takes care 
of any arbitrary cross section in a fairly simple way. 

1. Propagation between Two Parallel Mirrors.—A perfect reflector 
allows no field to penetrate within it; we have seen how this is to 
be realized in fact, by considering a good conductor, in which the 
field penetrates only to a distance comparable with the skin depth, 
which reduces to zero in the limit of perfect conductivity, which we 
assume. Thus the electric and magnetic fields are zero within the 
mirror, and our boundary conditions reduce to two statements: first, 
that the tangential component of E at the surface of the mirror is 
zero, since it must be zero within the mirror, and we know that the 
tangential component is continuous; secondly, the normal component 
of B at the surface must be zero, since B is zero within the mirror, and 
the normal component of B is continuous. The other two conditions, 
on the normal component of D and the tangential component of H, 
lead to information about the surface charge and current; for our 
surface is conducting and can carry charge and current, so that the 
value of the normal component of D outside the metal measures the 
surface-charge density, and the tangential component of H the surface- 
current density, which appear on the surface. Because of the lack of 
penetration of the field into the mirror, we need consider only incident 
waves striking the mirror, and corresponding reflected waves, but no 
refracted waves within the mirror. 

With this understanding of the nature of the reflection process, 
let us consider a wave striking a perfectly reflecting mirror, and the 
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corresponding reflected wave. Let the mirror be in the xz plane, the 
plane y = 0, and let the incident wave have its normal in the yz plane, 
making an angle of incidence i with the y axis, as shown in Fig. 26. 

Let the incident wave approach from posi¬ 
tive values of y, and negative z’ s. Then, 
using the same sort of arguments as in 
Chap. X, the exponential factor represent¬ 
ing wave propagation in the incident wave 
will be 

gj'wjt— [(—v cos i+z»ini)/c\) (1-1) 

and the corresponding factor in the reflected 
wave will be 

—[(2/ coh i+a sin t)/c]J ^ (1«2) 

We must now superpose these in such 
a way that certain components of the field 
(the tangential component of E, and the 
normal component of B) will be zero on the 
plane y = 0. The only way in which we 
can accomplish this is to superpose (1.1) 
and (1.2) in such a way that the function 
of y turns into a sine function. We do this 
by subtracting (1.2) from (1.1), and divid- 



V 


Fig. 26 . —Coordinate sys¬ 
tem for propagation between 
parallel planes. 


ing by 2 j. Thus we obtain the function 

( 03 

3in I— 


(a sin i)/c] gjjj 


cos i\ 

—)y- 


(1.3) 


The solution (1.3) may be interpreted as a disturbance that is propa¬ 
gated along the z axis with an apparent velocity c/sin i, which is 
greater than c, since sin i is less than unity; and varying sinusoidally 
along y , or corresponding to standing waves along y, with an effective 
wave length given by 


2tt _ 03 cos i 

X<j c 



cos % 


(1.4) 


where \ e is the effective wave length along y, Xo = 27rc/w is the free- 
space wave length, or wave length of the disturbance in the absence 
of reflections. The standing-wave 'phenomenon along y can be 
observed with visible light, in the form of the so-called “Lippmann 
fringes.” Lippmann showed that, if standing waves are set up by this 
process, not in free space, but in the sensitive emulsion of a photo- 
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graphic plate, the emulsion will be blackened along surfaces half a 
wave length apart in the standing-wave pattern. The phenomenon 
can also be demonstrated very easily, with the longer waves of the 
microwave region. 

We have so far considered only the reflection by one mirror, at 
y = 0. In case, however, we, have a second mirror, at y — L, we 
must satisfy the same sort of boundary conditions at this second 
mirror. Thus our function (1.3) must be zero, not only at y — 0, 
but also at y = L. This leads at once to the condition 

L = ^, (1.5) 


where n is an integer. In other words, the dependence of the dis¬ 
turbance on y cannot have any arbitrary wave length, but only an 
infinite set of discrete wave lengths. These form the modes that 
we have already mentioned. It should be noted that, to discuss this 
problem of two mirrors, we do not have to think of multiple reflection, 
in the elementary sense. Our one wave traveling along +y, and the 
other traveling along —y, suffice to satisfy the boundary conditions 
everywhere. If we treat the disturbance by means of rays, we should 
say that any given ray is reflected back and forth an infinite number 
of times from the two mirrors, but in fact all the rays traveling to 
the right coalesce to furnish the description of the single wave traveling 
to the right, and similarly with the rays traveling to the left. 

For a given n, or a given mode, the wave length X c is determined 
by (1.5). This is independent of the free-space wave length X 0 , or the 
corresponding frequency a>. Knowing Xo, from the conditions of our 
problem, we can next use (1.4) to find i, the angle of incidence, and 
from this we can find the apparent phase velocity v„ along the z direc¬ 


tion, determined by 


v 0 


c 

sin i 


( 1 . 6 ) 


and the corresponding wave length, 



Xo 

, , ■ p . « 

sin i 


(1.7) 


For many purposes it is convenient to eliminate i, the angle of inci¬ 
dence. Computing cos i from (1.4), sin i from (1.7), and taking the 
sum of their squares, we find at once the relation 

1.11 /1 o\ 
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This simple equation determines the wave length X„ along the z direc¬ 
tion, in terms of the free-space wave length X 0 , and the wave length X 0 
along the y direction, or the direction normal to z. We notice one 
interesting fact from (1.8). If X c is fixed, and if X 0 is increased, or 
the frequency is decreased, 1/XJ decreases, or increases, with conse¬ 
quent increase of -the phase velocity until finally when Xo equals 
X c , \ g and v g become infinite. For values of X 0 greater than X c , the 
wave length \ 0 and the velocity v g become imaginary, so that the wave 
is attenuated rather than propagated along the z axis. In other words, 
yre have the phenomena that we have already described, including a 
cutoff wave length, with only attenuation for longer waves than the 
cutoff wave length. Furthermore, we see that X c is the cutoff wave 
length in question. 

We have so far considered only the exponential function associ¬ 
ated with the direct and reflected waves, and the corresponding rela¬ 
tions regarding the wave length and frequency. In addition, we 
should take up the orientation and magnitude of the electric and ' 
magnetic vectors. As in Chap. X, there are two cases: that in which 
E is along the z direction, or at right angles to z for both the direct 
and reflected waves, and that in which H is in the x direction. In the 
language of wave guides, these are known, respectively, as the trans¬ 
verse electric (abbreviated TE) and transverse magnetic (abbreviated 
TM) modes of propagation. All wave guides, we shall find, have 
modes of both types. We could go through the equivalent of Fresnel’s 
equations, and find the relations between E and H in these two types 
of modes, but it is so easy to prove much more general results holding 
for any type of guide, as we shall do in the next section, that we shall 
not carry out this discussion here. 

Our problem of propagation between two mirrors forms a very 
simple and yet instructive case of wave-guide propagation. If we 
add two more reflecting surfaces perpendicular to the x axis, we have 
a rectangular wave guide. Some of the modes encountered with the 
mirrors, namely, those in which E is along the x axis, satisfy the 
boundary conditions for the rectangular guide as well, and in fact 
form important modes of this guide. They are not the whole set of 
modes, however, for there can be a sinusoidal variation along x as 
well as along y. If we furthermore add two reflecting surfaces per¬ 
pendicular to 2 , we form a totally enclosed rectangular cavity. We 
now cannot use the solution we have so far written, involving propa¬ 
gation along z; we must rather superpose a reflected wave traveling 
along — 2 , giving a standing wave along z. To satisfy the boundary 
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conditions at the mirrors, which may be located for example at z — 0, 
z = 'M, we must make the sinusoidal function along z , whose wave 
length is \ 0 , satisfy a condition analogous to (1.5), or 

M = (1.9) 

where m is an integer. Combining (1.5), (1.9), and (1.8), this leads to 

(§m) + (l z) = xf (110) 

That is, the free-space wave length, or the frequency, of the disturb¬ 
ance in the cavity is determined by the integers n and m. We have 
a discrete set of normal modes of oscillation, in which the cavity can 
resonate. 

The problem is similar to that of the elastic vibrations of a coupled 
system or of a vibrating membrane in mechanics, in which there are a 
discrete set of normal modes of oscillation, each with its own resonant 
frequency, and characteristic wave form. Furthermore, as in mechan¬ 
ics, we can introduce normal coordinates describing the various modes, 
using these normal coordinates to describe forced oscillations of the 
system, and we can prove orthogonality relations holding between 
the various normal vibrations. These subjects are, however, beyond 
the scope of the present book. We now proceed to a much more 
general discussion of propagation in wave guides, holding for a wave 
guide of arbitrary cross section, and reducing to the results of the 
present section in our present simple case. This general discussion is 
simpler and more powerful, but it does not exhibit the way in which 
the field is made up of the superposition of direct and reflected waves. 

2. Electromagnetic Field in the Wave Guide.—We shall assume 
a wave guide in the form of a hollow pipe of arbitrary cross section 
in the xy plane, extending indefinitely along z. As shown in Eqs. (1.2), 
Chap. VIII, each of the components of E and H must satisfy the wave 
equation, which, in empty space, is 

V’E- = V 2 H - = 0. (2-1) 

We now ask if we can obtain a solution of the form u(x,y)e J ^ ut ~^ 2vil/X ^j 
where u(x,y) is a function of x and y, and where X,„ the guide wave 
length, has the same significance as in Sec. 1. Substituting in (2.1), 
and using X 0 = 2xc/co, we find that u satisfies the equation 
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where X c is defined in terms of \ g and \o by (1.8). Equation (2.2) 
must be solved subject to certain boundary conditions, which we shall 
discuss presently. Subject to these boundary conditions, we find 
solutions for only a discrete set of values X c . From (1.8) we are 
then led, as in the preceding section, to the conclusion that X c forms 
a cutoff wave length, such that free-space wave lengths greater than 
X e are attenuated in the mode in question, while shorter free-space 
wave lengths are propagated. Thus those conclusions are quite 
general, and are not limited to the case of propagation between parallel 
reflecting planes. 

The fields E and H must satisfy not only the wave equation, but 
Maxwell's equations as well. When we write these equations down, 
we find at once, as mentioned in the preceding section, that two 
separate types of solutions are possible: solutions for which E z = 0, 
the transverse electric, or TE , waves, and solutions for which H z — 0, 
the transverse magnetic, or TM , waves. We shall let E f , H f be the 
transverse components of E and H (that is, E* = i E x + j E y , and so on). 
Letting k be Unit vector along the z axis, as usual, we then find easily 
and directly from Maxwell's equations the following relations: 

TE: grad H, = 2 irj H„ 

TM: grad E, = 2-rj E,. (2.3) 

In these expressions, H s , and E z represent that part of the correspond¬ 
ing quantity which must be multiplied by the exponential $/[«<-< 
to give the complete components. Since they are functions of x and y 
only, their gradients are in the xy plane, as is proper for H* or Ef. 
We find that there is a relationship between the transverse components 
of E and H, as follows: 

H* = where Z Q = A M° ^ for TE 

Aa \e 0 X 0 

- \fe i for ™- w 

This means, since k and Ei are at right angles to each other, that H* 
is at right angles to Ei in the xy plane, and is equal in magnitude to 
the magnitude of E f , divided by the quantity Z Q , which is clearly 
analogous to the corresponding quantity introduced in Eq. (2.1), 
Chap. VIII. 

Using (2.3) and (2.4)/ we can find all the components of E and H 
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from H z (in the TE case) or from E g (in the TM case). These quanti¬ 
ties, like the transverse components, satisfy the wave equation (2.2), 
and are scalar solutions of that differential equation. Furthermore, 
E z is zero on the boundary of the guide (since E must have no tangential 
component on the surface), while H z has a vanishing normal deriva¬ 
tive on the boundary (since H must have no normal component on the 
surface, and Hi is proportional to the gradient of H z ). We may draw 
lines of H z = constant (in the TE case) or of E z = constant (in the 
TM case) in the xy plane. Then by (2.3) the orthogonal trajectories 
of these lines will be along the direction of Hi (in the TE case) or Of E { 
(in the TM case). Finally, since by (2.4) the direction of Hi is 
perpendicular to that of E { , the lines of constant H z will be along the 
direction of Ei (in the TE case), and the lines of constant E z will be 
along the direction of Hi (in the TM case). Proceeding in this way, 
we may draw lines of force, for the transverse components of E and H, 
in the xy plane, finding of course that the electric lines of force meet 
the surfaces at right angles, while the magnetic lines of force are 
tangential to the surface. 

For every scalar solution of the two-dimensional wave equation 
satisfying the condition that it vanishes on the boundary, we get a 
TM wave, and for every solution whose normal derivative vanishes 
we get a TE wave, as we have seen above. There will be an infinite 
number of solutions of each type, each corresponding to a particular 
cutoff wave length. These wave lengths may be arranged in order 
of decreasing magnitude; they start with a largest cutoff wave length, 
associated with the lowest mode of oscillation, and extend indefinitely 
toward shorter and shorter wave lengths, so that we have an infinite 
number of modes of oscillation. We shall give in the next section 
some simple examples of the modes in various types of wave guides. 

In some cases, the first mode has an infinite cutoff wave length; 
in this case wo call it a “principal mode.” When a mode of infinite 
cutoff wave length, or principal mode, exists, it has great practical 
importance, because it can be used to propagate any wave length, no 
matter how long. The commonly used mode of the coaxial line is a 
principal mode, and the familiar parallel-wire transmission line, ordi¬ 
narily used for low frequencies, can also be considered as a wave guide 
with a principal mode. We find that such a principal mode exists 
only if the wall of the wave guide consists of at least two separated 
conductors, as for instance in the coaxial line. The physical reason 
for this is quite clear: we can put a very low frequency, or direct 
current, into a transmission line consisting of two or more conductors, 
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and they will be insulated from each other, and suited to conduct the 
current. If there is only one conductor, however, as in an ordinary 
hollow pipe, there would clearly be a short circuit for a low frequency 
or direct current, and no propagation is possible until we get to a wave 
length short enough so that something like real wave propagation 
occurs. For a principal mode, the cutoff wave length is infinite, so 
that (2.3) tells us that grad H Z] or grad E z , must be zero. That is, the 
longitudinal components of both E and H are zero, and such a wave is 
simultaneously transverse electric and transverse magnetic. It is 
sometimes called a “transverse electromagnetic ( TEM ) wave” for 
this reason. Furthermore for such a wave, as we see from (1.8), the 
guide wave length becomes equal to the free-space wave length, so 
that the velocity of propagation becomes c. Fin ally, from (2.4), the 
quantity Z 0 for a principal wave becomes VmoAo, which by Eq. (2.1), 
Chap. VIII, is the ratio of magnitudes of E and H in a plane wave in 
free space. Thus a principal wave has many of the properties of a 
wave in free space. 

3. Examples of Wave Guides.—The commonest type of wave guide 
is that of rectangular cross section, bounded by planes at x = 0, x — a, 
y = 0, y = b. The solution of the wave equation (2.2) in this case 
can be carried out in rectangular coordinates, and is at once obvious. 
Thus for a TE wave we have 


where 


H z 


rmrx 

cos-cos 

a 
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(3J) 
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and where m, n are integers. Using (2.3), (2.4), we then have for the 
other components 
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(3.3) 


Of this infinite set of possible TE solutions, only one is commonly 
used in practice, in a rectangular guide. If a > b, this is the mode 
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of longest cutoff wave length, given by m = l, n = 0. We note from 
(3.3) that for this mode E x = 0, and E y is proportional to sin ttx/cl, 
going to a maximum at the center of the guide, and is independent 
of y, so that the lines of force run straight across the guide. There is 
no mode corresponding to m = 0, n = 0, for then as we see from (3.3) 
all the transverse components of field are zero, and we verify easily 
that the field cannot exist. Thus there is no principal mode for the 
rectangular guide. 

For the TM waves, we have 


E z 


. rriTrx 

sin- 

a 


sin &(■«>*— 2 xs/x„) 
o 


(3.4) 


where the cutoff wave length is again given by (3.2). We thus -see 
that, for the case of the rectangular guide, the TE and TM waves have 
the same cutoff wave lengths; this is a special case, which does not 
hold for most shapes of guides. Clearly we must have m and n 
both equal to or greater than unity, however, in the case of the TM 
waves, as we see from (3.4), so that the case m = 1, n = 0, which 
provides the dominant TE wave, does not exist for the TM waves. 
From (3.4) we can find the other components of E and H, as we did for 
the TE waves in (3.3). 

The next most important example of wave guides after the rec¬ 
tangular guide is that in which the bounding surfaces are circular 
cylinders. This includes both the circular guide, and the coaxial 
line, which has two concentric cylinders, with the wave propagated in 
the annular space between. For either of these cases, we must handle 
Maxwell’s equations and the wave equation in polar coordinates. In 
Appendix IY, where we discuss vector operations in curvilinear coordi¬ 
nates, we see that the wave equation, equivalent to (2.2), is 


11 / r M 

r dr \ dr ) 


, 1_ dht, 

r 2 dd 2 



(3.5) 


This equation can be solved by separation of variables. We assume 
that u can be written as a product of a function of r, and a function 
of 0: u = R(r)0(6). Substituting, dividing by u, and multiplying by 
r 2 , (3.5) takes the form 


Ii 


Id 
r dr 


(-f) + (£)’*]+! 


d 2 0 
dd 2 


0. 


(3.6) 


The first term of (3.6) is a function of r alone, the second a function 
of 6 alone; hence by the usual argument used in separation of variables, 
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each term must be a constant. Let us assume then that 
1 d 2 © ^ 2 © 

© d6* ” "dO*" n ^ ~ © — sin nO or cos nQ. (3.7) 

Since u must be a single-valued function of position, it is clear that 
increasing 6 by must bring 0 back to its original. value; thus the 
quantity n must be an integer. Substituting the value from (3.7) 
back in (3.6), and multiplying by R/r 2 , we then have as the equation 
for R 


+ [(£)' -£]*-»• <»> 

Equation (3.8) is Bessel's equation, whose properties are dis¬ 
cussed in Appendix VII. It has two independent solutions, called 
Bessel's function and Neumann's function: 

R = J - (I?) ° r N * (tt)- ( 3 - 9 ) 

These functions have the following properties, discussed in Appendix 
VII: At x = 0, J n (x) is proportional to x n , whereas N n (x ) becomes 
infinite. For large values of x, J n (x ) and N n (x) approach the values 

Jn{x) yjE cos (* _ *)’ 

N„(x) -* sin - 2n ^~ 1 (3.10) 

From this, it is clear that, in a circular guide, where the field must 
be finite at r = 0, we must use only the Bessel functions J n . On the 
other hand, with a coaxial line, in which r = 0 does not fall within 
the region where we are finding the field, the Neumann as well as the 
Bessel functions can, and in fact must, be used. 

We see, then, that the function u which satisfies the wave equation 
(3.5) must be the form 

u — cos (nd — a) 4- BN n cos (w0 — b) J. (3.11) 

Here we have introduced amplitude constants A and B, and phase 
constants a and 6. For a TE wave, as we have seen before, u stands 
for H z , and its normal derivative must be zero at the boundary of the 
guide; that is, the derivative of the function of r with respect to r 
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must be zero. For a TM wave, u stands for E z , which must itself 
be zero on the boundary. Now let us consider first a circular wave 
guide, for which we need only the Bessel function J*„. If R\ is the 
radius of the guide, we must clearly have J w (27tjRi/X c ) = 0 for a TM 
mode, and J' n (2nrRi /\ c ) = 0 for a TE mode, where the prime indicates 
a derivative. 

By giving a table of the maxima, minima, and zeros of the Bessel’s 
functions, we can then find the values of X« for the various modes. 
The first few values are given in the following table: 


Maxima and Roots of Bessel’s Functions 


J'n(x nm ) = 0(TE) 

#ui — 3.832 

Xu = 1.842 

#21 =3.05 


#02 — 7.016 

X\2 — 5,330 

#22 =6.71 

Jn(x nm ) = 0 (TM) 

#oi = 2.405 

X\ i ~ 3.332 

X 21 === 5.135 


# 02 = 5.520 

X 12 — 7.016 

x%2 = 8.417 


If we let x nm be the mth value of x for which J' n {x ) — 0 (for the TE 
modes) or for which J n (#) — 0 (for the TM modes), we then clearly 
have the cutoff wave length given by 


\ c = ML 1 , (3.1.2) 

Xnm 

where the o;„ m ’s are given in the table above. Thus we can calculate 
the cutoff wave lengths of the various modes. Using (3.11), and the 
relations (2.3) and (2.4), we can then find the various components of 
the field, where we remember that the gradient must be computed in 
polar coordinates, as discussed in Appendix IV. 

For a coaxial line, we must satisfy boundary conditions, not at 
one value of r, but at two. This can be done only by combining the 
Bessel and Neumann functions, and using the extra arbitrary constant 
gained thereby. Thus, if the radius of the inner conductor is Ri, 
and of the outer one is /2 2 , we may combine the Bessel and Neumann 
functions in such a way as to satisfy the boundary conditions at R 
From (3.10) it is clear that a suitable combination of these two func¬ 
tions will act like a cosine or sine function with arbitrary phase, which 
can be chosen to put the zero, or the maximum, at any desired point. 
This can be done for arbitrary X c . Then we can choose X c so as to 
satisfy the boundary condition at R 2 as well. This process is a little 
involved, and it is not so easy to get the numerical answers to problems 
as with the circular guide, where we found the cutoff wave lengths 
explicitly in (3.12). The main interest in the coaxial line, however, 
is not in the modes that we investigate in this way, but in the principal 
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mode, whose cutoff wave length is infinite. This can be approached 
by a limiting process from the solution in terms of Bessel and Neumann 
functions, but it is much easier to discuss it directly from the wave 
equation (2.2). 

For a principal mode, where X c is infinite, the equation (2.2) reduces 
to Laplace’s equation d 2 u/dx 2 + d^u/dy 11 — 0. Thus the problem of 
finding u becomes mathematically equivalent to that of finding an 
electrostatic potential in a two-dimensional problem. In our case of 
circular symmetry, we may take u to be In r, so that its gradient points 
in the direction of r, and is a constant divided by r . With a coaxial 
line, this gradient may be chosen as E, with H at right angles to it, and 
we have satisfied our boundary conditions at the surfaces of the con¬ 
ductors. Thus we have, for the principal mode of a coaxial line, the 
very simple solutions 

Er = - ««»*-»"/*.> He = J—Er. (3.13) 

r \Mo 

The wave, like all principal modes, is propagated with the velocity of 
light. The mode, as in all principal modes, is TEM, both E and H 
being transverse. This is not the only case of principal modes that 
we can work out simply. For instance, for a parallel-wire transmission 
line, we may use our earlier solution for the capacity of two parallel 
cylindrical conductors, worked out in Sec. 2, Chap. II. There we 
found the potential of two such conductors, a solution of Laplace’s 
equation, reducing to a constant on the surface of each conductor. 
The gradient of this potential gave the electrostatic field, cutting 
each conductor at right angles. By the principles we have just 
stated, this same field is proportional to the value of E in the principal 
mode of the parallel-wire transmission line, and H is at right angles 
to it, so that the equipotentials are also the lines of magnetic force. 

4 4. Standing Waves in Wave Guides.—We have so far considered a 
wave propagated along the z direction in an infinitely long wave 
guide, and varying according to an exponential e y(tJi_2,rvXo) . Now, 
however, we consider the possibility that the wave may be reflected 
from an obstacle at the end, sending back a reflected wave. This 
reflecting obstacle may be a perfect reflector, such as a metallic 
wall; then the reflected wave will be equal in amplitude to the incident 
wave, and there will be standing waves in which there are nodes 
and antinodes, the amplitudes canceling and resulting in zero dis¬ 
turbance at the nodes, with addition of the amplitudes midway 
between. On the other hand, the obstacle may be only a partial 



Skc, 4] 


WAVE GUIDES AND CAVITY RESONATORS 


143 


reflector, allowing some radiation to penetrate it, reflecting the rest. 
In this case, the reflected wave will have less intensity than the incident 
wave, and the interference of the two w r ill never completely cancel the 
disturbance. 

If the amplitude of the transverse electric field in the reflected 
wave is r times as great as in the incident wave, where r is sometimes 
called the “reflection coefficient,” the amplitude of the resulting dis¬ 
turbance will vary from something proportional to 1 + r, where the 
two waves add, to something proportional to 1 — r, where they oppose 
each other. The ratio of maximum to minimum E, or (1 + r)/(l — r), 
is called the “standing-wave ratio” in voltage. We see that it can 
go from unity, when r — 0, and there is no reflection, to infinity, when 
r — 1, and there is perfect reflection. The points along the guide 
where the resultant transverse electric field is a maximum are called 
“standing-wave maxima,” and those where it is a minimum are 
“standing-wave minima.” For an infinite standing-wave ratio the 
standing-wave minima become the nodes, and in any case they come 
a half wave length apart. It is clear that from a measurement of 
the magnitude of the standing-Wave ratio we can find the reflection 
coefficient r, and from the positions of the standing-wave minima we 
can find the phase change on reflection. 

There is one simple case in which we can calculate the reflection 
coefficient, easily. That is the case in which part of the guide, say 
for positive z, is filled with a dielectric, while the rest, say for negative 
2 , is empty. Then a wave approaching the surface of separation from 
negative z’s will be partly reflected, partly transmitted, just as a wave 
approaching a surface of glass from air is partly reflected, as we have 
seen in Chap. X. In simple cases, we could use Fresnel’s equations, as 
derived in that chapter, to find the reflection coefficient in the present 
case, but we shall instead proceed directly, showing that the results 
are in close analogy to those of Chap. X. First we must, consider the 
nature of the held in the part of the guide filled with the dielectric. 
If we follow through the derivation of Sec. 2, we find that the only 
change introduced by filling the space with a dielectric is to change 
€o to €. This has the effect first of changing the velocity of propagation. 
Thus the froe-space wave length X 0 is equal to \/n times the value for 
empty space, where n is the index of refraction. For a given cutoff 
wave length, (1.8) then shows that X„ will be smaller than in free 
space, though not in the same ratio as Xo. In Fq. (2.2), the wave 
equation for u, we find that there is no change introduced as a result 
of the dielectric. Thus the problem of finding cutoff wave lengths 
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and satisfying boundary conditions at the boundary of the guide is as 
before. In (2.3), the only change in the process of determining the 
transverse from the longitudinal components of field comes in the 
changed value of \ g , which changes the scale of the transverse com¬ 
ponents, but not the directions of the vectors. Similarly in (2.4) the 
only change in the relation between transverse E and H comes from 
the change in the quantity Z Q} which is different both because of the 
direct change of e and because of the change in \ g . 

Now let us consider a direct and a reflected wave for negative z, 
and a transmitted wave for positive z. If E£ represents the tangential 
electric field in its dependence on x and y, we may write the tangential 
electric and magnetic fields in the first medium as 


=== —2‘JTZ/X C r) J 




—2 71*2/ Afl) 


Ygj ( 0 )t+ 2 irz/\ a ) 1 


(4.1) 


Here the minus sign in the expression for the transverse magnetic 
field in the reflected wave is necessary so that the Poynting vector 
of the reflected wave will point along —z. Now at z = 0, at the 
boundary of the second medium, we have from (4.1) 


■cr k X Ef 1 — r 

— zv~ r+T 


(4.2) 


But, because of the continuity of the tangential components of E and 
H, the values of Hi and E t in (4.2) are also the values found in the 
second medium. On the other hand, by the modification of (2.4) 
appropriate for the second medium, we must also have 


Hl = * L irr‘> ( 4 - 3 > 

where Z' 0 is the value for the second medium. Equating (4.2) and 
(4.3), wo may solve for r, finding 


_ Zl - Z o 

Z'o + Zo 


(4.4) 


In (4.4) we have solved the problem of finding the reflection coefficient 
for the electric field, at a boundary between two media in a wave 
guide. The analogy of this to Eq. (3.4), Chap. X, is obvious, though 
the meaning of the symbols is somewhat different. Proceeding in 
the other direction, we have 


1 + r Zl 
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Thus we see that a measurement of reflection coefficient allows us to 
find the quantity Z$ characteristic of the second mediym. 

Often in actual practice there are other sorts of discontinuities 
in wave guides in addition to a simple change of index of refraction. 
For instance, there may be a diaphragm in the guide, with a hole 
of some arbitrary shape in it. At such an obstacle, part of the wave 
will be reflected, part transmitted. We can then set up a reflection 
coefficient, just as we have done in the case we have discussed, and 
can get the properties of the reflection by measuring the standing- 
wave ratio and the position of the standing-wave minima . In such 
cases it is often convenient to define a quantity like the Z' 0 of (4.5), 
characterizing the obstacle. We have already pointed out, in Sec. 2, 
Chap. VIII, the analogy between quantities like Z 0 and an impedance; 
in this way we are allowed to assign certain impedances to obstacles in 
wave guides, which prove to be pure reactances in case they introduce 
no losses. A further discussion of such reactances is beyond the scope 
of this text. 

5. Resonant Cavities.—We have already pointed out that, by ter¬ 
minating a guide with two perfectly reflecting walls, we form a resonant 
cavity. At each such wall, say at z — 0, z = M, the field will be 
perfectly reflected, so that there will be a standing-wave pattern set 
up within the cavity, with infinite standing-wave ratio. To satisfy 
boundary conditions at both walls, we must have the' tangential 
electric field zero at each, and this can be accomplished only if there 
is an integral number of half wave lengths in the length M. Thus 
we have the condition 



where m is an integer. 


Putting this into Eq. (1.8), we then have 



(5.2) 


That is, the free-space wave length, and frequency, are definitely 
determined for a cavity resonance. For each cutoff wave length of 
the guide, we shall have an infinite number of resonant modes of the 
resonant cavity; so that, since in general there will be a double infinity 
of cutoff wave lengths, there will be a triple infinity of resonant modes 
for the cavity. We can, of course, get formulas for the resonant 
frequencies in any case in which we can compute the cutoff wave 
lengths, as in the cases of rectangular and circular guides which we 
have already discussed. 
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The resonant cavities formed in this way, by putting reflecting 
walls at the ends of a guide, form but a special case of the general 
problem of resonant cavities. Resonant modes of oscillation can 
exist in any cavity bounded by perfectly reflecting walls, no matter 
what its shape. In some cases we can solve for the fields (as for 
instance in the spherical cavity, which we shall mention in the next 
chapter), but in the great majority of peases there is no practical 
way to solve the wave equation. We cannot in general reduce tlie 
solution of Maxwell’s equations to the solution of a scalar wave 
equation like (2.2), as we have been able to do for the guide; we must 
find vector solutions from the outset. The problem of discussing 
fields inside an arbitrary resonant cavity is then a complicated one, 
but nevertheless we can prove certain general theorems about them, 
even without being able to solve for them analytically. These 
theorems are, however, too complicated for us to take up here. 


Problems 

1. A wave travels along an air-filled coaxial line having an inner conductor of 
radius a and a sheath of inner radius 6, both of negligible resistance, the peak 
voltage and current of this wave being V and I. Using cylindrical coordinates, 
with the z axis coincident with the axis of the line, show that the electric and 
magnetic fields are given by 

0*) 7gj'(«i-0aO 

^ r r In 6/a ’ ^° 2 irr 

What value must /3 have? Find the characteristic impedance V/I and the wave 
impedance E r /Ho in terms of «o, /xo, a, and 6. 

2. Starting with Eq. (3.4) for the longitudinal electric-field component, derive 
expressions for the transverse components of E and H for TM waves in a rec¬ 
tangular wave guide. 

3. For the case of TE and TM modes in a rectangular wave guide, compute 
Poynting's vector, its average value, and from this the power transmitted by these 
modes. For the lowest TE modes in a rectangular wave guide, find an expression 
for the maximum power that can be transmitted if the electric field cannot exceed 
a value Eo without sparking. 

4. Find expressions for the field components for TE and TM waves in a circular 
wave guide. Discuss the degeneracies present in these modes, especially those 
resulting from the relation J[ (x) = —Ji(x). 

5. Show by direct substitution in Maxwell’s equations that the following field 
for a TM wave is a solution of these equations: 


H x - 

E z — T 


-*L E„ - 

f}a>H 

up '\/k 2 — / 9 2 tr 
TTjj ■"*» 


— jcajjur. Use this solution to discuss the propagation of a surface 


where k 2 — « a eju 
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wave along a metal surface. Let the metal fill the space y < 0, with air for y > 0. 
Write the appropriate solutions in both regions, and show from the boundary 
conditions that is given by 1//S 2 *» l/k 2 + 1 /k%, with &§ =» cu*«ju. Now assume 
that the conductivity a of the metal is high so that \k 2 \ » Compute the total 
current in the metal per unit width in the x direction, and show that it is equal to 
the magnetic field at the metal-air surface, just as if one had a perfect conductor. 
From the y component of the Poynt.ing vector, compute the power flow into the 
metal at the boundary y «* 0, and show that it is of the form I 2 R, with I the rms 
total eurrent in the metal. Show further that this resistance R is the same as if 
the eurrent density were uniform to a depth below the metal surface equal to the 
skin depth and zero below this. 

6. A wave guile is terminated by a thin conducting surface perpendicular to 
the axis of the guide. Find the resistance per square of this conductor (that is, 
the resistance neross a square of the material, from one side to the opposite side, 
which is independent of the size of the square), such that a wave falling on the 
surface will be totally absorbed, with no reflection. Find- the relation of this 
resistance per square to the quantity Z 0 of (2.4). 

7. A wave guide is filled with a dielectric giving a value of Zoflfrom (2.4)] for 
z < 0; another dielectric giving Z\ for 0 < z < L; and a third giving Zi for L < z. 
A wave travels in the direction of increasing z , and in addition there are reflected 
waves, in the direction of decreasing z , for z < 0, and for 0 < z < L. By con¬ 
sidering the boundary conditions at. the surfaces of separation, find the amplitude 
of the reflected wave for z < 0. Show that this amplitude is zero if L equals a 
quarter of a guide wave length (in the material filling the section 0 < z < L), and 

if Y>\ — 



CHAPTER XII 

SPHERICAL ELECTROMAGNETIC WAVES 


Suppose we have an electric charge oscillating sinusoidally with 
the time. This charge will send out a spherical electromagnetic wave, 
radiating in all directions. Several physical problems are connected 
with such a wave. First, the phenomenon may be on a large scale, as 
in a radio antenna. Radiation from a vertical antenna, as a matter 
of fact, can be treated approximately by replacing the antenna by 
such an oscillating charge. But also on a smaller scale we can treat 
the radiation of short electromagnetic waves, or in other words light, 
from an atom that contains oscillating electrons. The electrons may 
have been set in motion by heat or bombardment, in which case we 
have the treatment of the emission of light from a luminous body; or 
they may be in forced motion under the action of another light wave, 
as in the case of the scattering of light. As a first step in the discussion 
of these problems, we consider the solution of Maxwell’s equations in 
spherical coordinates. This solution, similar to the one we found in 
Chap. Ill for the static problem, will reduce in a simple case to the 
field of an oscillating dipole, which we require for the applications 
mentioned above. At the same time, it will give us the field of any 
oscillating multipole. Other forms of the same solution will lead to 
the field inside a spherical resonant cavity, or more generally in a 
cavity resonator bounded by surfaces r = constant, 0 — constant, and 
(p — constant, in a system of spherical polar coordinates. We pass 
then to the solution of Maxwell’s equations, later taking up their 
applications. 

1. Maxwell’s Equations in Spherical Coordinates.—In spherical 
polar coordinates, r, 0, <p, we can define a vector such as E in terms of 
its components E ri E&, E v , along the directions in which these coordi¬ 
nates increase. Then, using the methods of vector operations in 
curvilinear coordinates, which are discussed in Appendix TV, Maxwell’s 
equations for free space, in the absence of charge and current, become 


1 

r sin 0 
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( 1 . 1 ) 

The first three of these represent the equation curl E Hr mo d’H.jdt = 0, 
the fourth is div H = 0, the next three are curl H — « 0 dE/dt = 0, 
and the last is div E = 0, where as before the divergence equations 
contribute no new information not included in the others. The 
solutions of these equations represent, in general, waves propagated 
outward, or inward, along the radius. Thus the r direction can be 
considered the direction of propagation. As with the case of the 
wave guides considered in Chap. XI, there are two types of waves: 
transverse electric, or TE, in which the longitudinal component of the 
electric field, E r , is zero; and transverse magnetic, TM, in which 
H r is zero. We shall set up the solutions of both types, and consider 
their properties. 

The solution of the wave-guide problem was simplified a great 
deal because we could assume from the outset that each component 
of the field varied exponentially along the z axis, the direction of 
propagation. It cannot be assumed in a corresponding way here 
that the components vary exponentially with r; as a matter of fact, 
their dependence on r is given by certain Bessel’s functions, which 
approach an exponential form only for large values of r. Therefore 
we cannot at once carry through derivations like those of Eqs. (2.2), 
(2.3), and (2.4) of Chap. XI, expressing all the field components in 
the TE case algebraically in terms of Hz, and all the field components 
in the TM case in terms of E z . Nevertheless an essentially equivalent 
discussion can be carried through, except that here the relations are 
not simply algebraic but involve differentiation. 

Thus let us start with the TE case. We let H r be a scalar function 
of r, 6, <p . It is slightly more convenient to operate, not with H r , but 
with a quantity that we may denote u , defined by 

u — rH r . 
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It will appear that the equation that must be satisfied by u is the wave 
equation, which in spherical polar coordinates becomes 


I l. ( r * 

r 2 dr \ dr ) 
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1 d {„• 
r 2 sin e ae \ sm 
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1 dht 
r 2 sin 2 0 d<p 2 


1 dhi 
c 2 at 2 


= o. 


(1.3) 


We assume a sinusoidal dependence on time, varying according to 
the exponential e } ' wt . The equation can then be solved by separation 
of variables, just as in Sec. 2, Chap. Ill, where we discussed the cor¬ 
responding static problem. Letting u be of the form 


u — R(r)Q(6)$(ip)e iu>i , 

we find that the functions satisfy the equations 

dR 


ld(.dR\ p SL+i) 1ft- 
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1 d ( . do\ . f 7/7 I -j \ m 2 1 
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(1.5) 

(1-6) 

(1.7) 


similar to Eqs. (2.3), (2.4), and (2.1), of Chap. III. We shall discuss 
the solutions of these equations in the next section. Assuming that 
we have found u from them, we can now set up values for all the other 
field components. 

It is not easy to proceed straightforwardly to a solution of Max¬ 
well's equations. We find, however, the following equations for the 
field components in terms of u: 


Hr = 


u 
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By direct substitution in Maxwell 7 s equations, (1.1), we can show 
that these equations are satisfied if u obeys the wave equation (1.3), 
or its equivalent equations (1.5), (1.6), (1.7). Thus we prove that 
the functions (1.8) are the solutions of the problem which we desire. 
We note that, instead of defining He and H v directly in terms of u, we 
can instead set up the relations 


He = 


H 


<p 


lid 
jc^Ho r dr 


(.rEJ, 


(rE,). 

joifj .0 r dr 


(1.9) 


We shall soon see that these equations form the analogy to the rela¬ 
tion (2.4)- of Chap. XI, giving the relation between the transverse 
components of E and Hina wave guide. 

The relations for the transverse magnetic, TM , case are analogous 
to those just set up. In that case, we have a longitudinal component 
E r of electric field, which we write as 


Er 



( 1 . 10 ) 


where v satisfies the same wave equation (1.3) that u satisfied. In 
terms of v , we can then write the other components as 
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where we have the relations 


Ee = 
E v = 


-l i a 

jcoeo r dr 

li a 

jeoeo r dr 
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We now have our complete set of equations for determining the fields, 
and shall consider their solutions in the next section. 

2. Solutions of Maxwell’s Equations in Spherical Coordinates.— 
To find the components of a field in spherical coordinates, we have 
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seen that we must first find a scalar u or v, satisfying the wave equa¬ 
tion (1.3), which separates to give (1.5), (1.6), and (1.7). The depend¬ 
ence on angle, given in (1.6) and (1.7), is just as it is in the static case 
taken up in Chap. Ill, and as in that case we find that © = Pr(cos 0), 
^ ss sin w><p or cos 7 n<p. The equation for the function R, however, is 
more complicated than in the static case, though it reduces to it for 
co = 0. If we make the substitution R = f/Vr, where / is a function 
of r, we find that (1.5) is transformed to Bessel's equation for a Bessel 
function of order Z -j— Thus we have R = J i/2(w7’/c)/" v/t or 
Ni+i/2(o)r/c)/^/r. 

It is customary to define spherical Bessel and Neumann functions 
ji(x) and ni(x) by the equations 

ji(x) — ^^ J z+ 1 / 2 ( 37 )} = ^2x ■^ z ‘*' 1/2 (* c ). (2-1) 


If Z is an integer, which it will be in our applications, we can show, as 
is mentioned in Appendix VII, that ji and ni can be expressed in 
analytic form in terms of algebraic and trigonometric functions. For 
the first few functions we have 
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At large values of x, the terms in 1/x are the leading ones, 
general case, this leading term is given by 


ni(x ) 



Z + 1 
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In the opposite limit, as x tends to zero, we can expand in power 
series. In this case the leading term is 


Ji(x) 

ni(x) 
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(2.4) 


In terms of these spherical Bessel functions, it is clear that our func¬ 
tions u or v can be written in the form 


u or 


v = jy, or n 1 ^ J P™(cos 0)(sin m<p or cos rrup). (2.5) 
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The question of which functions to use in (2.5) depends on 
the boundary conditions to be satisfied, and this in turn depends on 
the problem that we are trying to solve. The simplest problem is the 
field within a cavity resonator bounded by a perfectly conducting 
spherical shell. In this case the field must be finite at the origin; 
therefore we can use only the functions ji, not the Neulnann functions 
ni. Furthermore, the electric vector may have only an r component 
at the value of r giving the radius of the sphere. This means, for a 
TE mode, as we see from (1.8), that ji(cor/c) must be zero at the 
radius of the sphere, and for a TM mode, from (1.11), that the r 
derivative of rji(a>r/c ) must be zero at this radius. These conditions 
easily determine the resonant frequencies of the various modes. We 
can almost as easily satisfy the boundary conditions for a resonator 
formed by the space between two concentric spheres. By making 
suitable combinations of ji(c»r/c) and ni(o>r/c), we see at once, from 
(2.2), that we can adjust the phase of the sinusoidal part of the func¬ 
tion to any desired value. We then may choose the phase correctly 
to satisfy the boundary condition on one of the surfaces, and determine 
the frequency to satisfy the conditions on the other surface. These 
problems have an obvious analogy to the two-dimensional problems 
of the modes within a circular cylinder, and a coaxial line, discussed in 
the preceding chapter. 

More interesting problems come when we take up the field in a 
region extending to infinity. In such cases, we are generally inter¬ 
ested, not in standing waves, as in the solutions we have set up so far, 
but in traveling waves, traveling either inward or outward along r. 
Thus for instance in a problem of the emission of radiation from an 
atom or a radio antenna, we wish only waves traveling outward. 
It is easy to combine the Bessel and Neumann functions to secure 
traveling waves^_The functions ji(z) ± jni(x) [where we are to dis¬ 

tinguish j = y/ — 1 from ji(x )] are called “spherical Hankel functions.” 
From (2.3) we see that at large distances they approach the values 

jl{x) ± jrii(x ) —»— (H-Dir/ 2 ]. (2.6) 

X 

Thus, if x is replaced by cor/c, we see that for the + sign this repre¬ 
sents the space part of a wave traveling in along r, and for the — sign 
a wave traveling out, in each case with an amplitude that falls off 
inversely as the distance, so that the intensity, which is proportional 
to the square of the amplitude, falls off inversely as the square of the 
distance. For radiation from a source, then, we choose the — sign 
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in (2.6), and thus secure only an outgoing wave, without an incoming 
wave. We note that the Hankel functions, containing the Neumann 
functions, all have singularities at the origin. Thus an outgoing 
wave must either be propagated from a finite charge distribution, 
within which, since the charge density is different from zero, Maxwell's 
equations in the form (1.1) do not apply; in this case, a separate 
solution for the field within the charge distribution must be found, 
and it must be joined to the solution we have found; or the outgoing 
wave must be produced by a singularity of field, which can take the 
form of certain point charges, dipoles, and other multipoles. We shall 
take up examples of such singularities in a later section. 

A very important form of problem is that of scattering of radiation 
by an object of some sort. If the object has spherical symmetry, as 
for instance if it is a small sphere of metal or of a dielectric, we can 
solve the problem of scattering by means of our spherical solutions 
of Maxwell's equations. For instance, let us consider scattering by a 
perfectly conducting sphere. Physically, we consider a plane wave 
incident on the sphere, and scattered spherical waves emerging from 
it. In this simple case, there is no mechanism for loss of energy, 
and all the energy scattered is made up from energy that strikes the 
sphere. We must make up the field by superposing solutions of the 
type we have considered. These solutions must satisfy boundary 
conditions of two sorts. First, at the surface of the sphere, the 
tangential components of electric field, and radial components of 
magnetic field, must be zero. To satisfy these, we must superpose 
Bessel and Neumann functions, in such a way as to make the function 
u equal to zero at the surface of the sphere for a TE mode, or the 
normal derivative of rv equal to zero for a TM mode. This can be 
done straightforwardly, and the result in general will not turn out to 
be a Hankel function, and hence will represent both an incoming and 
an outgoing spherical wave. 

In addition to the boundary condition at the surface of the reflecting 
sphere, we must satisfy another condition, which requires more 
thought. We can easily state it physically: the disturbance must 
consist entirely of an incoming plane wave, plus outgoing spherical 
waves. To apply this condition", we must know how to describe a 
plane wave in terms of Bessel and Neumann functions. This is a 
rather involved problem, but in the similar case of a scalar wave 
equation, such as we should find for the scattering of sound, where we 
need only consider the pressure as a function of position, it becomes 
fairly simple. Considering only a scalar solution of the wave equation. 
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one can prove the following theorem: 

oo 

~ * = y (2Z + l)(±j)*Pi(oos e)ii(*r). (2.7) 

Z-0 

Here Pz(cos 0) is the value which P 7 (cos 0) takes on when m — 0. 
We note that r cos 6 = z, so that, if we set A- = <o/c, the function 
on the left of (2.7) represents the space-dependent function which, 
combined with a time exponential e 3 ' ut , would represent a plane wave 
traveling along the + z direction. We thus see how to expand such a 
plane wave in terms of spherical Bessel functions. It is natural that 
no Neumann functions come into this expansion, for they would intro¬ 
duce a singularity at the origin, which the plane wave does not have. 

To satisfy our requirements, we must then build up a solution 
of our problem, consisting of the sum of the plane wave (2.7), and 
only outgoing spherical waves. That is, the term of the solution 
associated with a given l value, and with m = 0 (for, as we easily see, 
this problem will involve only terms for m = 0), will have the form 

(21 4- l)(±j)Pz(cos 6){ji(kr) + on[ji(kr) — jni(kr)]} (2.8) 

where cn is a coefficient measuring the amplitude of the Zth scattered- 
wave component. Thus we see that the ratio of the coefficients of 
ji(kr) and ?h(kr) must be (on 4- 1 )/(—jon). On the other hand, this 
ratio has been determined quite independently from the condition that 
the solution satisfy the boundary condition at the surface of the 
reflecting sphere. Thus, by equating these two values of the ratio 
of coefficients, we can evaluate the ais, and find the intensity of the 
scattered radiation. 

This procedure in principle is simple, but in practice a complete 
discussion of the problem is beyond the scope of this text. We recall 
that our field is really a vector field, so that instead of (2.7) we must 
use a corresponding vector formulation of a plane wave in terms of 
spherical waves. When we do this, and carry through the discussion, 
we find the following sort of result: The behavior of the scattered 
radiation depends a great deal on the ratio of the diameter of the 
scattering sphere to the wave length. For a sphere very small 
compared with the wave length, the sphere acts almost as if it were 
in a uniform field. It then acquires a dipole moment, as we have 
shown in Chap. Ill, Sec. 4. This dipole alone scatters radiation, and 
we shall give an elementary discussion of this case in a later section 
of the present chapter, showing the type of radiation emitted from a 
dipole. As the sphere gets larger, it scatters a more and more compli- 
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cated field, consisting now not only of dipole radiation, but of the 
fields of higher multipoles, which are discussed in Appendix VI. As 
the sphere becomes very large compared with a wave length, the 
outgoing wave takes on less and less of the character of a scattered 
wave, and approaches a reflected wave as predicted by geometrical 
optics for the simple optical problem of the reflection of a plane wave 
by a spherical mirror. In all these cases, we must consider not merely 
that part of the spherical wave which travels out in all directions, but 
in particular that part which travels in almost the same direction 
as the plane wave, after it has passed over the scattering obstacle. 
In this case, the scattered and incident waves interfere with each other, 
and it can be shown that they interfere destructively, so as to remove 
energy from the incident wave, just enough to supply the energy 
radiated out in the scattered wave. If the obstacle is small compared 
with the wave length, this interference fills a poorly defined region 
behind the obstacle, whose dimensions are determined by interference 
theory, but as the obstacle becomes large, the region of destructive 
interference becomes sharply defined, and becomes the ordinary 
shadow of geometrical optics. Around the edge of this shadow, how¬ 
ever, there are diffraction fringes, alternations of intensity, resulting 
directly from the interference phenomenon. 

All this interference and diffraction comes as part of a complete 
discussion of the problem of scattering of a plane wave by a sphere, 
and makes it clear why a complete treatment of the problem is beyond 
the scope of the present book. However, we shall handle interference 
and diffraction in an elementary way in a later chapter, bringing out 
many of the features that are present in the exact solution. The type 
of discussion sketched in the present section is important not only in 
electromagnetic theory, but in all other forms of wave theory as well. 
A simple example is the scattering of sound, as treated for example 
by P.M. Morse, Vibration and Sound, 2d ed. (McGraw-Hill Book 
Company, Inc., New York). More far-reaching is the application 
to quantum mechanics. The problem of scattering of an electron, 
neutron, or other colliding particle by an atom or nucleus can be 
handled to a first approximation, in wave mechanics, by a problem of i 
scattering in a spherically symmetrical region where the index of 
refraction is a function of r. The mathematical framework of this 
problem is identical with that which we have sketched here, and a 
thorough understanding of the solution of the wave equation in spher¬ 
ical coordinates is an essential for the understanding of any type of 
scattering of waves by small obstacles. 
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3. The Field of an Oscillating Dipole.—In many ways the most 
important spherical wave physically is that produced by an oscillating 
electric dipole. This is the simplest TM wave. If we consider our 
solution (1.11), we see that the function v must depend on $ or in 
order that any of the components of field may be different from zero. 
Thus we cannot have l = 0, since the spherical harmonic Po(cos 0) 
is a constant and does not depend on angles at all. The lowest value 
that l may have is 1. In this case, m can be 0 or 1. We consider the 
case l = 1, m = 0. This proves to be the simple dipole field. If we 
choose v to be 

v — cos d[ji(kr) — jni(kr)], 
or 


v = cos 9 



(3.1) 


where k = ca/c , which is equivalent to it according to (2.2), we find 
for the field components 
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The other field components are zero. These functions, of course, are 
to be multiplied by an arbitrary amplitude. 

To understand the meaning of our solution, let us first consider 
the terms that are important at small distances, those in the highest 
inverse powers of r (1/r 3 for E r and E & , 1/r 2 for H v ). We shall show 
that the terms in the electric force represent the field of an electric 
dipole at the origin and that the magnetic field is the field of the 
corresponding current clement, derived from the time rate of change 
of the dipole moment, as found from the Biot-Savart law. We 
remember that a dipole of moment M has a potential 


, (M cos 0) 

* -w 5- ' ’ 

as we saw in Eq. (5.1), Chap. III. ,The field is then 
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We observe tli&t these are just a constant, ^7r 2 ikf/27reo, times the 
corresponding terms in (3.2). Similarly we note that a dipole whose 
moment was Me iwt would carry with it a current element equal to its 
time derivative, or to jooMe jut . This small current element would 
have a magnetic field H<p, which would be given according to the 
Biot-Savart law, as described in Sec. 1, Chap. V, by 


rr J<* M sin Q 
“ 4tt 6 r 2 


(3.4) 


This is likewise —jk 2 M/2ire 0 times the value from (3.2), at small 
distances. It is clear, then, that if we multiply (3.2) by this factor, 
we shall find a field that is a correct solution of Maxwell’s equations 
at all distances, and yet that reduces to the field of a dipole of moment 
]Me la,t at small dis tances, so that it must represent correctly the field 
of such a dipole. The field is 
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4. The Field of a Dipole at Large Distances.—We have seen that 
at sufficiently small distances the field of an oscillating dipole reduces 
to what we should compute by Coulomb’s law and the Biot-Savart 
law. As kr becomes appreciable compared with unity (that is, as 
2 ttt/X beco m es appreciable, or as v approaches l/2ir wave lengths) the 
remaining terms become important. This is an example of a general 
situation i at distances from an oscillating charge distribution that 
are small compared with wave length, we can use Coulomb’s law and 
the Biot-Savart law. On the other hand, for distances large compared 
with a wave length, the situation is entirely reversed, and the first 
term in each expression in (3.5) becomes the important one. The 
leading terms in Eq and H v vary as 1 /r, while the leading term of J5J r 
goes as 1/r 2 , and therefore can be neglected at sufficiently large dis¬ 
tances. The field, in other words, becomes transverse at large dis¬ 
tances, with an amplitude inversely proportional to the distance. In 
this range it is ordinarily referred to as the “radiation field.” We see 
that E and H become at right angles to each othe r, and check further 
that the ratio of Eg to H v approaches the value •%/mo/ eo, characteristic 
of plane waves in free space. 
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The intensity varies as sin 2 9, so that the ma.viTrmm radiation is 
at right angles to the axis of the dipole. To measure the intensity, we 
may compute the average value of Poynting’s vector. We have 


S r « ^AReEeS v 


2 R ° 


sin 2 9 


16x 2 €o " L(/rr) 2 

Mo a/como to 4 _M 2 sin 2 9 
327T 2 7' S 


lj_ i _] 

L(A-r) 2 (Ar) 6 J 


(4.1) 


We see that the intensity of radiation varies as 1/V 2 , the inverse- 
square law, as of course it must if the total flux outward through 
a sphere is independent of the size of the sphere. To find this total 
flux, we must integrate over all directions, by multiplying by the 
element of area 2xr 2 sin 9 dd and integrating from 0 to x. We then 
have 

J Sda = a. J’ a j n , ed$ 

Mo aAoMo co 4 M 2 /jt 

-IS- < 4 - 2 > 

This is a well-known formula for the radiation from a dipole. The 
two essential features are that the radiation is proportional to the 
square of the amplitude of the dipole, and to the fourth power of 
the frequency. It can be used for such diverse problems as finding 
the rate of radiation from an excited atom, or from a radio antenna, 
if the latter can be approximated, as it often can, as an oscillating 
dipole. 

5. Scattering of Light.—In Sec. 3 we have given an indication 
of the theory of the scattering of light. We can take up easily a 
special and important case of scattering. Let a plane wave fall on 
the type of dipole that we have assumed in Chaps. IV and IX as 
being responsible for the dispersion and dielectric properties of matter. 
We assume the dipole to be small in dimensions compared with the 
wave length, so that the electric field acting on the dipole may be 
taken to be E 0 e iut , disregarding the variations with position. Let the 
dipole have an equation of motion 


d 2 x . 

m -77x 4- 


dt 


2 


mg 


dx 

dt 


+ moilx = eE 0 e iui , 


(5.1) 


as in Eq. (1.2), Chap. IX. Then the dipole moment will be 
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e 2 E 0 e’ ut 
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m cog — co 2 4- jug 


(5.2) 


This is the oscillating dipole moment produced by the field. 

The dipoles set into motion by the wave will emit light, which is 
scattered. The rate of emission by a single dipole is found by sub¬ 
stituting the absolute value of the dipole moment in (5.2) into (4.2). 
Often the scattering is measured by the scattering cross section cr. 
This by definition is the area on which enough energy falls from the 
plane wave to equal the scattered intensity. Thus the rate of emission 
must equal <r times the Poynting vector of the plane wave, which is 
aAo/mo Eq. Making the substitutions, we then find 


32tt __/ e 2 V 

3 [(cog — CO 2 ) 2 + (co<7) 2 ] \87reo?nc 2 / 


The quantity (e 2 /&n- e 0 mc 2 ) is the quantity that we found, in Sec. 4, 
Chap. VIII, to be a classical radius R of the electron of charge e. 
Thus we find the interesting result that dimensionally the scattering 
cross section is determined by the square of this classical radius, 
or by the classical cross section of the electron, though with a numerical 
factor, and with a factor depending on frequency, which behaves quite 
differently in different parts of the spectrum. We shall now consider 
three important special cases of this scattering formula, holding for 
different frequencies: 

a. The Rayleigh Scattering Formula .—This is what we have in 
the case where <o is small compared with co 0 . Since for ordinary 
atoms o )o is a frequency in the ultraviolet, we have this condition 
in the visible range of the spectrum. Then (5.3) becomes 

' = 3 -t R1 fc ) 4 '' (5 - 4 > 

The scattering is here proportional to co 4 , or to 1/X 4 , where X is the 
Wave length. This is the Rayleigh scattering formula, developed to 
discuss the scattering of light by the sky. The proportionality to the 
inverse fourth power of the wave length means that the short blue 
and violet waves will be scattered by the air molecules much more 
than the long red ones, resulting in the blue color of the scattered light 
from the sky. The transmitted light thus has the blue removed and 
looks red, explaining the color near the sun at sunset. 

b. The Thomson Scattering Formula .—In the other limiting case 
of X rays, when the frequency is large compared with w 0 , the scattering 
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becomes 



the Thomson scattering formula. This formula gives a scattering 
independent of the wave length, and is very important in discussing 
X-ray scattering by substances. 

c. Resonant Scattering .—If co is nearly equal to co 0 , it is evident that 
the denominator can become very small, resulting in very large 
scattering. This phenomenon can be much more conspicuous than 
the other two cases. Thus a bulb filled with sodium vapor, which 
has a natural frequency in the visible region, illuminated with light 
of this color, will scatter so much light that it appears luminous. 
This phenomenon is called “resonance scattering.” Here, as with 
the absorption lines discussed in Chap. IX, we have to correlate our 
classical theory of electric dipoles with the quantum theory. Actual 
atoms scatter, as they absorb, at frequencies connected with quantum 
transitions between energy levels. However, the same description 
of the dipoles that is correct for describing absorption and dispersion 
is also applicable to scattering. 

One observation regarding scattering is that, if the incident light 
is plane polarized, the dipoles will all vibrate along the direction of 
its electric vector. Thus there will be no intensity in the scattered 
light along this direction. The scattered light will have a maximum 
intensity at right angles, and it will be plane polarized. It was by 
experiments based on these facts that the polarization of X rays was 
first found. 

6. Coherence and Incoherence of Light.—In the preceding para¬ 
graphs, we have calculated the scattering from a single dipole or a single 
atom, when a plane wave falls on it. It is found in practice that, 
in a gas containing N molecules per unit volume, the total intensity 
scattered by unit volume will be just N times that scattered by a 
single molecule. On the other hand, if the molecules are regularly 
arranged in a crystalline solid, there is practically no scattering. 
The effect rather is the dispersion and absorption discussed in Chap. 
IX. We must give closer consideration to the question of why this 
distinction exists, and why in particular the intensity of the scattered 
radiation from N molecules of a gas is N times that from a single 
molecule. Since the Maxwell equations are linear, the field vectors 
E and H satisfy the superposition principle, so that we should expect 
the total amplitude to be the sum of the amplitudes in the various 
waves, in which case the total intensity, being the square of the ampli- 
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tilde, would certainly not be the sum of the separate intensities. The 
key to this situation is found in the relations between the phases 
of the various waves that we are adding: if they all have fixed phases 
relative to each other, they are said to be "coherent,” and the ampli¬ 
tudes add; whereas if they are in phases having random relations to 
each other they are "incoherent,” and the intensities add. 

To be more precise, let us consider the sum of a number of sinu¬ 
soidal waves, all of the same frequency, but of different amplitude 
and phase: 


^ -dfc cos {oit — a.k) = cos akj cos cot + Ak sin 

h k k 


sin cat. 


If all the phases should be the same, say a* = 0, then the amplitudes 
of the cosine and sine terms will be ^ A k and 0, respectively, so that 

k 

the amplitudes add, and the intensity is proportional to (2 A^, or, 

k 

if for instance there are N terms of equal amplitude, proportional to 
N 2 times the intensity of a single wave. On the other hand, the a ’s 
may be completely independent of each other, meaning that each a is 
equally likely to have any value between 0 and 2tt, independent of the 

others. Then we can see that ^ A k cos cx k will be far less than ^ A k , 

k k 

since we shall have just about as many terms with positive values of 
cos a k as with negative, and the terms will just about cancel. 

The cancellation will not be complete, however, as we see if we 
compute the squares of the summations, which we must add to get the 
intensity. The square of the first summation, for instance, is 

(X ^* C0S aAj ) = X C ° s2 a * “t" X AkAi cos Oi k cos a t . 




We must find the average of this, taking the a's as independent. 
That is, we must perform the operation of integrating each a from 
0 to 2 tt, and dividing by 2 tt. When we do this, the terms cos 2 a h 
average to 3^, while the products of two independent a’s average to 

zero, leaving ^ ^ A\. The other summation gives an equal term, 

k 

so that we find that the mean-square amplitude, or mean intensity, 
averaged over phases, is the sum of the individual intensities. This 
is the state of complete incoherence, in which for N waves the intensity 
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is N times the intensity of a single wave, rather than N 2 as for the 
coherent case. The cancellation of waves, then, while not complete, 
is more and more perfect as N increases, for AT becomes a smaller and 
smaller fraction of N 2 as N increases. 

We can now apply the idea of coherence to the scattering of light 
from a gas. The phase of the wave at a point P, scattered by an 
atom at a, in Fig. 27, depends on the total path the light has traveled 
from the source to a, and from a to P. Since the molecules of a gas 
have no fixed positions with respect to each other, these paths are in a 
random relation to each other, the phases are incoherent, and we 
are justified in adding intensities. Such a procedure would not be 
allowed for example in discussing the scattering of X rays by crystals, 



where the various atoms are in fixed lattice positions. Indeed, here 
we do get interference, and it is just by studying the interference 
patterns so obtained that we obtain our information about the lattice 
structure of crystals. Neither would the procedure be allowed in 
discussing the scattering from a gas in the same direction as the inci¬ 
dent radiation, as in ( b ), Fig. 27. For then the paths of the beams 
scattered from the various atoms are approximately equal, the waves 
are in phase, and they produce a resultant field at P proportional 
to the amplitude, rather than the intensity, of the incident wave. 
This scattered field can be shown to interfere with the incident wave 
in such a way that the resultant produces the refracted wave. The 
close relation of our scattering formulas to the formulas for the index 
of refraction, therefore, becomes clear, and it is evident that our two 
problems of refraction and scattering, though we have treated them 
separately, are really parts of the same subject. Scattering straight 
ahead produces refraction, and does not depend on exact placing of the 
molecules. Scattering to the sides, on the other hand, does not occur 
unless the molecules have a random arrangement; then the intensity, 
not the amplitude, is proportional to the number of molecules. 

We have just been considering the coherence of scattered light. 
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There is another aspect of coherence, related to this, which can be 
discussed in a similar way. This is the incoherence between differ¬ 
ent independent sources of light, even when they have the same 
frequency. It is based on the relation between coherence and the 
spectrum. The amplitude of a wave of light, as a function of time, 
is never exactly sinusoidal, but is really a much more complicated 
function. It is often desirable, however, to resolve such a function 
into a spectrum; that is, write it as a sum of sinusoidal waves of 
different frequency. This can be conveniently done by Fourier series, 
as is discussed in Appendix III. 

To do this, we take a Fourier series with an extremely long period 
T, so long that all the phenomena we are interested in take place in a 
time short compared with T, so that we are not bothered by the 
periodicity of the series. Then, if our function is f(t), we have 


fit) = ^ (A n cos a> n t + Bn sin co n t), 


where 


n 


__ 2 f T/2 
T J - T/2 


fit ) cos cc n t dt, B f 


-U 


T/2 


T/2 


fit) sin c o n t dt, 


00 n — 


2 im 

T ‘ 


This gives an analysis into an infinite number of sine waves, with 
frequencies spaced very close together (on account of the very small 
size of 2i t/T). No actual, physical wave is then perfectly sinusoidal, 
in the sense of having but one term in this expansion with an amplitude 
different from zero. We shall show in a problem that even a perfectly 
sinusoidal wave that persists for only a finite length of time will have 
appreciable amplitudes for all those frequencies within a range Aco, 
equal in order of magnitude to the reciprocal of the time during which 
the wave persists, so that a sine wave of long lifetime will correspond 
to a sharp line in the spectrum, while a rapidly interrupted wave will 
give a broad line. This is observed experimentally in the fact that 
increasing the pressure of a gas, thereby making collisions more 
frequent and interrupting the radiating of the atoms, broadens the 
spectral lines. 

The intensity is proportional to pit), or to the square of the summa¬ 
tion over frequencies. Just as before, this square consists of terms like 

cos 2 c*3 n t, and cross terms like A n A m cos a n t cos 03 m t. Instantaneously 
none of these terms are necessarily zero. But if we average over 
time, the terms of the first sort average to A 2 /2, while those of the 
second sort average to zero. The final result, then, is that the time 
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average intensity is the sum of the intensities of the various fre¬ 
quencies:/ 5 ^ = }/2 ^ ( Ai + Bl). We are justified in considering the 

n 

terms connected with a given n to be the intensity of light of that 
particular frequency in the spectrum, so that we have the theoretical 
method of determining the spectral analysis of any disturbance. And 
we see that the following statement is true: on a time average, sinu¬ 
soidal waves of different frequencies are always incoherent, and never 
interfere. 

It is known experimentally that light from two different sources 
never interferes; to get interference we must take light from a single 
source, split it into two beams, and allow these beams to recombine. 
If we regarded the sources as being monochromatic, it would be hard 
to see why this should be, for the amplitudes of two waves of the same 
frequency should add, rather than the intensities, and this is the essence 
of interference. But when we observe that each source really is 
represented by a Fourier series, the situation becomes plain. For 
two sources are always so different that their Fourier series will be 
entirely different. If we analyze both of them, the phase of the radia¬ 
tion of frequency w« from one will be entirely independent of the phase 
of the corresponding frequency from the other. Thus if we add, 
square, and average over this random relation between the phases 
of the two sources, the cross terms will cancel, and the intensities 
add. The randomness comes in this case, not in adding a great many 
terms of the same frequency, but in combining the terms of different 
frequencies, which are related in entirely independent ways in the two 
sources. 

Problems 

1. Discuss the weakening of sunlight because of scattering, as the light passes 
through the atmosphere. Assume that the molecules of the atmosphere have a 
natural frequency at 1,800 A (whore absorption is observed). Let each molecule 
contain an electron of this frequency. Assume that the number of molecules is 
such as to give the normal barometric pressure. Find the fractional weakening of 
a beam duo to scattering in passing through a sheet of thickness ds , and from this 
set up the differential equation for intensity as a function of the distance. Solve 
for the ratio of intensity to the intensity before striking the atmosphere, for the 
sun shining straight down, and for it shining at an angle of incidence of 60°. Con¬ 
stants: e — 1.60 X 10 -10 coulomb, m — 9.1 X 10“ 31 kg, number of molecules in 
1 gm-mol = 6.Q3 x 10 23 . 

2. A vibrating dipole radiates energy, and therefore its own energy decreases. 
Noting that the rate of radiation is proportional to the energy, set up the differen¬ 
tial equation for the energy of the dipole as a function of the time. Find how long 
it takes the dipole to lose half its energy. Work out numerical values for the 
sort of dipole considered in Prob. 1. 
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3 . Using the results of Prob. 2, find the equivalent damping term that would 
make the dipole lose energy at the same rate as the radiation. This damping is 
called the “radiation resistance.” 

4. Suppose we have an alternating current of maximum value J, in a vertical 

antenna of length L. Treating this as a dipole, find the total radiation. Show 
that the equivalent resistance necessary to produce the same power loss (the 
radiation resistance) is _ 

R =~ V- 7? ohms - 
3 * €0 A * 5 


5. Show that the field of an oscillating dipole of moment Me> ut , pointing 
along the axis of the spherical coordinates, can be derived from the following scalar 
and vector potentials: 


t - 


M d 


4:1T€q dr 


t pju ({ T/c) "1 

e - 1 —\ c °s 
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juti 0 Me’ u ^ r / e > 


4irr 


cos 0 


Ae -s « — 0111 

Ap = 0 . 

6- The so-called “Hertz vector” arising from a dipole of moment at a 

distance r from the dipole, is equal to 

^ e ju>U-r/c) 


II = 


4-7r€t\r 


and points in the direction of the dipole moment. Show that in terms of the 
Hertz vector the potentials can be found from the expressions 

A = como t = - div II. 

at 


7 . Show from the Hertz vector that the field at large distances from a dipole 

is given by ___ 

E - { r x [ r x £ 

H - -i[ rX ^ Mei "‘" A> ]' 

8 . Find the spectrum of a disturbance that is zero up to t — 0, is sinusoidal 
until t = To, then is zero permanently. (Hint: Make the period T of the Fourier 
series indefinitely large compared with To.) 

9. Find the spectrum of a disturbance that starts at t = 0, and is a sinusoidal 
damped wave after, that. Show that the curve for intensity as a function of 
frequency has the same form as a resonance curve, in general, and that its breadth 
is connected with the damping constant in the same way. This illustrates an 
important principle: the emission and absorption spectra of the same substance 
are essentially equivalent. The resonance curve represents the absorption curve, 
because of the relation of forced oscillators and dispersion, whereas the damped 
wave is the emission. (Hint: Make the period T indefinitely large compared with 
the time taken for the oscillation to 1/eth of its value.) 



CHAPTER XIII 

HUYGENS* PRINCIPLE AND GREEN’S THEOREM 


Huygens* principle provides a well-known elementary method for 
treating the propagation of waves, and in this chapter we shall con¬ 
sider its mathematical background, showing its close connection with 
Green’s theorem. The method is this: From each point of a given 
wave front, at t = 0, we assume that spherical wavelets originate. 
At time t , each wavelet will have a radius ct, and the envelope of these 
wavelets will form a new surface, which according to Huygens is 
simply the resulting wave front at this later time t. Thus, if the 
original wave front was a plane, it is easy to see that the final one 
will be a plane distant by the amount ct, while if it is a sphere, the final 
wave front will be a concentric sphere whose radius is larger by ct. 
In either case this construction gives us the correct answer, agreeing 
with the more usual methods of computation. The one difficulty 
is that our construction would give a wave traveling backward, as 
well as one traveling forward; the solution of this difficulty appears 
when we use the methods of the present chapter. 

We may look at our process in a slightly different way, not used 
by Huygens, but developed early in the nineteenth century when the 
interference of light was being worked out by Young and Fresnel. 
Suppose that, instead of taking the envelope of all the spherical wave¬ 
lets, we consider that each of these wavelets has a certain amplitude, 
consisting of a sinusoidal vibration. We then add these vibrations, 
just as if the wavelets were being sent out by interfering sources of 
light, and the resulting amplitude is taken to be that in the actual 
wave. This process can be shown to lead to essentially the same result, 
and it is this which can be justified theoretically. As a further gen¬ 
eralization, it is not necessary to take the original surface to be a wave 
front; it can be any closed surface, so long as we allow the scattered 
wavelets to have the suitable phase and amplitude. 

Our final result, then, is this: The disturbance at a point P of a 
wave field may be obtained by taking an arbitrary closed surface, and 
performing an integration over this surface. The contribution of a 
small element of area da of this surface equals the amplitude at P of a 
spherical wave starting from da at such a time that it reaches P at 
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time t. The simplest form of spherical wave is one we did not men¬ 
tion in Chap. XII. It is a scalar solution of the wave equation, 
independent of angles, and depending only on r. The wave equation 
(1.3) of Chap. XII, in a case where u depends only on r, may be written 

d 2 (ru ) 1 d 2 (ru ) _ „ 

dr 2 c 2 ~HT 2 ’ 


which has solutions u — — C - 

T 


} where / is an arbitrary function. 


If we have a simple spherical wave of this type, we shall choose the 
— sign, so as to represent a wave traveling out along the radius with 
velocity c, and of amplitude varying inversely as the distance. Now 
the contribution of an element da must surely be proportional to the 
disturbance at do, which we may call / (a function of time and posi¬ 
tion), and to da. Hence we have something like / da for 

the final result. We are thus led to a formula of this sort: 


/ (at a point P) = constant X J —— — da, 

where the surface integral is over a surface surrounding P. This sug¬ 
gests the solution of Laplace’s equation by Green’s method, where 
we obtained the value of a function (p at an interior point of a region 
where VV was zero as a surface integral over the boundary. As a 
matter of fact, an analogue to Green’s theorem is the correct statement 
of Huygens’ principle, and replaces the formula that we derived 
intuitively above, and that is not just correct. We shall now proceed 
to set up this theorem, first introducing the retarded potentials, which 
are closely related to the problem. 

1. The Retarded Potentials.—In Chap. VII, we introduced scalar 
and vector potentials, <p and A, giving the electric and magnetic fields 
by the relations 

E = — grad <p - ~ 

B — curl A. 


For these potentials, in empty space, we found the equations 
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c 2 dt 2 
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or d’Alembert's equation. We first ask how to get a solution of 
d’Alembert’s equation analogous to the simple solution 



of Poisson’s equation. We shall not carry through the proof of the 
solution, for that is rather complicated. But the essence of the solu¬ 
tion of Poisson’s equation is that we divide up all space into volume 
elements dv , and that p dv/AK^r is the potential of the point charge 
p dv at a distance r. This potential, of course, is a solution of Laplace’s 
equation, as is 1/r, at all points except for r = 0, where the charge is 
located. 

In a similar way, to solve d’Alembert’s equation, we divide up 
our charge into small elements, and write the potential as the sum 
of the separate potentials of these small charges. The separate poten¬ 
tials must now be, except at r = 0, solutions of the wave equation. 
This means that, since any change of the charge will be propagated 
outward with the velocity c, the potential at a given point of space 
resulting from a particular charge cannot be derived from the instan¬ 
taneous value of the charge, but must be determined, instead, by 
what the charge was doing at a previous instant, earlier by the time 
r/c required for the light to travel out from the charge to the point 
we are interested in. In other words, if p(x,y,z,t) is the charge density 
at x, y, z at the time t, and r is the distance from x, y, z to x ', y f , z ' 
where we are finding the field, we shall expect the potential of the 
charge in dv to be 

p(a;,y,g,l — r/c) do 
r 


and for the whole potential we shall have 

_ 1 f p(x,y,z,t — r/c) dv 

^ 47T€o J r 

= _ _L f [W - (i/c 2 )(awa* 2 )W c dv 

4ir J r 


(1.3) 


This solution is, as a matter of fact, correct. We have already 


seen that — 


T / 

is a solution of the wave equation, where / is 


any function, so that the integrand actually satisfies the wave equa¬ 
tion, as in the earlier case 1/r satisfied Laplace’s equation. The 
potential <p determined by this equation is called a “retarded poten- 
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tial,” since any change in the charge is not instantaneously observable 
in the potential at a distant point, but its effect is retarded because 
of the finite velocity of light. The solution for each component of 
the vector potential is determined in an analogous manner. 

2. Mathematical Formulation of Huygens* Principle. —In discuss¬ 
ing the application of Green’s theorem to the solution of Poisson’s 
equation in a finite region of space, we proved in Sec. 6, Chap. Ill, that 


<P 


--f 

4 x J 


VV 


dv 


~ S / V grad r ' a ~ ? grad v ' n ) 


da, (2.1) 


the result of the last paragraph being the special case where the region 
of integration is infinite and the surface integral drops out. We now 
wish to find an analogous theorem for use with d’Alembert’s equation. 
Here again we shall not give a real derivation, for this is complicated, 
but shall merely describe the formula that results, and show that it is 
plausible. We have already discussed the volume integral. In the 
surface integral, the first term gave the potential of a double layer of 
strength proportional to <p/Atc, the second the potential of a surface 
charge of magnitude proportional to 

(i) grad <p • n, or (~) (ff)- 

Each of the terms, <p grad (1 /r) • n, or <p[d(l/r)/dn] and (1 /r)(d<p/dn) 
is a solution of Laplace’s equation since it represents the potential of 
certain charges. 

In our case of the wave equation, the formula has two correspond¬ 
ing terms: one giving the potential of a double layer, the other of a 
surface charge. But now the charges change with time, so that we 
must use solutions of the wave equation in the integral. We have 
already seen that the solution of the wave equation corresponding to 
1 Jr is /(£ — r/c)/r; hence we expect the second term to be replaced 
by —(l/r)(d<p/dn) t -r/c, where this means that the partial derivative, 
which is now a function of time as well as of position on the surface, 
is to be computed, not at i, but at t — (r/c). Similarly corresponding 
to 3(1 /r)/dn, the difference of the potentials of two equal and opposite 
point charges at neighboring points of space, we have 

A [" /(* — lAO 

dn j_ r 

Remembering that in differentiating with respect to n we must regard 
r as a variable each time it occurs, this is 
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= _ cos (w,r) [~ /(< - r/e) , 1 a/(f - r/c) ] 


L' 

where in the last term we have used the relation 


J 


& fit — r/g) 
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da. (2.2) 


r/c ) d(£ — r/c) 
d(< — r/c) 

= - r/c) / 1 aA 

\ c aw/ 

= - - cos (n r) ggl r / c >. 
c ' J dt 

We should therefore expect to have 

T _J. f [w - (i/c»)(9Vtf)u . 

V 4n- J F-— cfo 

+ 4rf'r {[/ («),_ r/r + ^ ~ ] cos M 

+ 

This, as a matter of fact, is the correct formula. The first term 
represents the potential due to all the charge within the volume; if 
there are no sources of radiation within this volume, the volume 
integiaMs then zero, and that is the usual case with optical applica¬ 
tions. The surface integral represents the remaining potential as 
arising from a distribution of charge and double distribution about the 
surface, each surface element sending out a wavelet that on closer 
examination proves to be the Huygens' wavelet we are interested in. 
Thus, star ting from Oreen's theorem and d'Alembert's equation, we 
have^ arrived at a mathematical formulation of Huygens' theorem. 

T. o give a suggestion o± the rigorous proof of this formula, we could 
Proceed as follows: First, we notice that <p defined by this integral 
satisfies the wave equation; for since each term of the integrand 
separately is a solution, the sum must also be. Now it follows from 
this, although we have not proved it, that if the solution reduces to 
the conect boundary values at all points of the boundary, the solution 
must be the correct one, the reason being essentially that the boundary 
values determine a solution uniquely, so that, if we have one solution 
of the equation with the right boundary values, it must be the only 
correct solution. We must then show that the <p defined by the 
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integral actually has the correct boundary values. This could be 
done by a more careful treatment, and we should then have a demon¬ 
stration of the formula. The more conventional proof, however, is a 
fairly direct though complicated application of Green's theorem. 

We shall now take our general formula (2.2), and apply it to the 
cases we meet in optics, showing that it reduces to something like 
the formula that we derived earlier intuitively. We suppose that 
light is emitted by a point source, and that the value of some quantity 
connected with it, and satisfying the wave equation (one of the com¬ 
ponents of the fields or potentials—they all satisfy the same relations) 

«(f—ri/c) 

has the form-> where r 1 is the distance from the source to the 

r i 

point where we wish to find the disturbance. Then we wish to get 
the disturbance at P, not by direct calculation, but by using Huygens' 
principle. Suppose we take a closed surface. This surface can sur¬ 
round either the source, or the point P where we wish the disturbance. 
In any case, we define n as the normal pointing out of the part of the 

ri/c) 

space in which P is located. At a point of the surface, <p = - > 

where r\ is the distance from the source to the point on the surface. 
We then have, if r is the distance from P to a point on the surface, 


'(■ - o - 


(r+ri)/c] 

Vi 


d(p{t — r/c ) _ jcoAe J ' wlt ~ ( ' r+ri)/ei 
dt ri 

d(p{t — r/c) 


dn 


= —A cos 


(n,ri) (I + &) 

Vl c / 7*1 


—(r-H’O /c] 


Thus finally, substituting in (2.2) and rearranging terms, 
A 




7Ti 


0]u(C—(r+ri)/e] 


[(r + t) COS 

— ^ cos (n,ri)^j da. 


(2-3) 


In this formula, as in Chap. XII, we have two sorts of terms, some 
significant at small, others at large values of r and r\. Wc easily 
see that, if r and ri are large compared with a wave length, as is always 
the case in optics, the only terms we need retain are those in jw/c. 
Hence to this approximation 

<P — J e /«[«-(w-rx)/c][ cos — cos (n,ri)] da. 


(2.4) 
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This final form suggests our earlier, intuitive formulation of 

Huygens' principle. The incident amplitude at da is n ow 

ri 

we set up, starting from da , a wavelet whose amplitude is this value, 
retarded by the amount r/c, divided by r, and multiplied by the factor 
0’/2X)[cos (w,r) — cos (n,n)] da. This is just what we should expect, 
except for the last factor. The term j introduces a change of phase 
of 90°, not present in Huygens' form of the principle, but necessary. 
The term cos (n,r) — cos (n,ri) makes the wavelets have an amplitude 
that depends on angle. When r and r x are in opposite directions, which 
is the case when the surface is between the source and P, the factor 
approaches 2; whereas, when r and r x are parallel, and the surface is 
beyond P, it becomes zero. This means that the wavelets do not 
travel backward, thus removing the difficulty noticed earlier in 
Huygens' method. The wavelets have an amplitude depending on 
their wave length, decreasing for the longer wave lengths. 

3. Integration for a Spherical Surface by Fresnel’s Zones. —Let 
us now carry out our integration, and verify Huygens' method, in a 
simple case. We take the surface to be a sphere, surrounding the 
source, and therefore a wave front. We note that n is the inner 
normal of the sphere. Thus r 1 is constant all over the sphere, and 
cos (n,ri) = — 1 at all points, so that the formula simplifies to 

jAc^-"'* f er** r , x , ,, „ 

<P = -2Xr^- J ~T~ ^ C0S + ^ da ' 

where k = co/c = %r/\. Now suppose we introduce, as a coordinate 
on the sphere, the distance r from the point P; that is, we cut the sphere 
with spheres concentric with P, laying off zones between them, as in 
Fig. 28. We can easily get the area between r and r + dr, and hence 

r x dQ 

r 

P 

Fig. 28.— Construction for Fresnel’s zones on a sphere surrounding the source. 

the element of area. Take as an axis the line joining the source 
and the point P, and consider a zone making an angle between 6 and 
6 + dd with the axis. The area of the zone is 2irr\ sin $ d6. But now 
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by the law of cosines, if B is the distance from the source to P, 

7*2 as r 2 — 2Rr\ COS 0, 

and differentiating, 2r dr = 2Eri sin 0 cZ0. Hence for the area of the 
zone we have dr. Introducing this, we have 


P 

* “KP 


) /* rmax 

J rmin 


6 -jfcr[ CO s (n,r) + 1] dr, 


where = R — n, r MS — B , UT? ., 

To carry out this integration, we use a device called Fresnel s 

zones,” giving us an approximate value in a very elementary way. 
Beginning with r^, we take a set of zones such that the outer edge 
of each corresponds to a value of r just half a wave length greater 
than the inner edge. The contributions of successive zones will almost 
exactly cancel. The integral, then, consists of a sum of terms, say 
8l _ 8> 4- . . • + s n , where the magnitudes of Si, s 2 , * * * , vary only 
slightly from one to the next. Now it is true in general that in such a 
series the sum is approximately half the sum of the first and last 
terms. We can see this as follows: We group the terms 


Si 


+ 


($-« + ?) 




+ 


/ S n —2 

V 2 


311-1 


+ 


2/ 


. $71 

+ 2* 


Now, because of the slow variation of magnitude, we have very nearly 
gfe— i -f~ Sfc+i . t£ this were so, however, each of the parentheses 

*>fc 2 

s i + s n 


would vanish, leaving only- ^ 

In our case, the contribution of the first zone is to be considered, 
but that of the last zone is practically zero, because of the factor 
cos (n,r) + 1, so that the result is half the first zone. Now, in the 
first zone, cos (n,r) + 1 is so nearly equal to 2 that we can take it 
outside the integral, obtaining 




the correct value. 


irj Ae 3 '“ ( *- ri/CJ 
X R 

2 R 

—r i/c) 

R 

12/c) 

R ; 


/ 'Ji-r i 
R — r i 

(e- jkr ) 

0—jk(.R—r i) 


-n+X/2 


R-n + \/2 
R—r\ 


(3.1) 
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In the derivations of this chapter we have traveled in a very 
roundabout way to reach a very obvious result. We naturally ask, 
what is Huygens principle good for, aside from a mathematical 
exercise? The answer is found in problems of diffraction. There 
one has certain opaque screens, with holes in them, and a light wave 
falling on them. If the light comes from a point source, geometrical 
optics would tell us that the shadow of the screen would have perfectly 
sharp edges. But actually this is not true; there are light and dark 
fringes around the edge of the shadow. If the shadow is observed 
at a greater and greater distance, these fringes get proportionally 
larger and larger, until they entirely fill the image of the hole. Finally 
at great distances the fringes grow in size until the resulting pattern 
has no resemblance at all to the geometrical image. There are then 
two general sorts of diffraction: first, that in which the pattern is 
like the geometrical image, but with diffuse edges, and which is called 
“Fresnel diffraction”; secondly, that in which the pattern is so 
extended that it has no resemblance to the geometrical image, and 
which is called “Fraunhofer diffraction.” Both types of diffraction, 
as well as the intermediate cases, can be treated by using Huygens* 
principle. 

4. Huygens’ Principle for Diffraction Problems. —Suppose that 
light from a point source falls on a screen containing apertures, 
and that we wish the amplitude at points behind the screen. Then 
we surround the point P, where we wish the field, by a surface con¬ 
sisting of the screen, and of a large surface, perhaps hemispherical, 
extending out beyond P, and enclosing a volume completely. We 
apply Huygens’ principle to the surface. In doing so we assume 
(1) that the amplitude of the incident wave, at points on the apertures, 
is the same that it would be if the screen were absent; and (2) that 
immediately behind the screen, and at points of the hemispherical 
surface as well, the amplitude is zero, the wave being entirely cut off 
by the screen. This is, of course, an approximation, since at the edge 
of a slit, for example, the amplitude of the wave does not suddenly 
jump from zero to a finite value. The exact treatment is exceedingly 
difficult, but in the few cases for which it has been worked out, it 
substantiates our approximations, if the dimensions of the aperture 
arc large compared with the wave length, showing that they lead 
to a correct qualitative understanding of the phenomena, though 
there are quantitative deviations. 

To find the disturbance at P, then, we integrate over the surface, 
but set the integrand equal to zero, except at the openings of the 
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screen, obtaining 


ip — J i e , 0 ,t *“ (r+r i )/el [ c o s ( n > r ) “■ cos (n,ri)] da, 


the integral being over the openings. We note that only the edges 
of the openings are significant, the shape of the screen away from 
the openings being unimportant. Now let us assume, as is almost 
always true in practice, that the distances n and r, from source to 
screen and from the screen to P, are large compared with the dimen¬ 
sions of the holes. Then 1/rri and [cos (n,r) — cos (n,ri)] are so 
nearly constant over the aperture that we may take them outside 
the integral, replacing r and ri by mean values f and f\. If in addi¬ 
tion we write r + r\ in the exponential as f + fj. + r' + r[, where 
r' and r[ are the small differences between r and ri, and their values 
at some mean point of the aperture, we have finally 


ip = j^[cos {n,r) — cos (n,ri)]e J " lfr " <iH ' Fl)/cl / e~ 3 ' k( - r ' +r ^ da. (4.1) 
* rr i J 


The whole factor outside the integral may be taken as a constant 
factor so that, if we are interested only in relative intensities, we 
may leave it out of account. We finally have a sinusoidal vibration 
of which the amplitudes of the components of the two phases are 
proportional to C' — f cos k(r' + r[) da, and S' — J sin k(r' + r() da. 
Hence the intensity is proportional to C n + S ' 2 , and our task is to 
compute this value. 



Fig. 29.—Fresnel’s zones on a plane. 


By using Fresnel’s zones, one can see qualitatively the explanation 
of the diffraction fringes, particularly in Fresnel diffraction. Suppose 
that we join the source S and a point P with a straight line, as in 
Fig. 29, and consider the point of the screen cut by this line, a point 
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for which r + ri has a minimum value. Let us surround this point 
by successive closed curves in which r ■+* **i differs from its minimum 
value by successive whole numbers of half wave lengths. It is not 
hard to see that these curves will be the intersections with the screen 
of a set of ellipsoids of revolution, whose foci are S and P. Hence 
if the line SP is approximately normal to the screen, the curves will be 
approximately circles. Successive zones included between successive 
curves will propagate light differing by a half wave length from their 
neighbors. Now on the screen we may imagine the pattern of zones, 
and also the apertures. The whole nature of the diffraction depends 
on what zones are uncovered, and can transmit light, and what ones 
are obscured by the screen. We may distinguish three cases, shown 
in Fig. 30: 



Fio. 30.—Fresnel's zones and rectangular aperture. (1) Directly in path of light. 

In geometrical shadow. (3) On edge of shadow. 


( 2 ) 


1. The center of the system of zones lies well inside the aperture. 
The central zone is entirely uncovered, as are a number of the others. 
As we get to larger zones, we shall come to one of which a small part 
is covered; then one that is more covered; and so on, until finally we 
come to one only slightly uncovered; and then the rest are entirely 
obscured. Now we can write our integral, as in Sec. 3, as a sum of 
integrals over the successive zones. As before, these contributions will 
decrease very gradually from one zone to the next. When we reach 
the zones that are obscured, the decrease will become a little more 
rapid, but not so much as to interfere with the argument. We can 
still write the whole thing as half the sum of the first and the last 
zones. In our case, the last zone that contributes has a negligibly 
small area exposed, so that it contributes practically nothing, and the 
whole integral is half the first zone. But this gives just the intensity 
we should have in the absence of the screen. 
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2. The center of the zone system is well behind the screen (P is 
in the geometrical shadow); Then the first few zones are obscured. 
A certain zone begins to be uncovered, until finally some zones are 
uncovered to a considerable extent. Large zones become obscured 
again, however. Thus in our sum, although there are terms different 
from zero, both the first and the last terms are zero, so that the sum 
is zero. The intensity well inside the geometrical shadow is zero. 

3. The center of the zone system is near the edge of the screen. 
Then the first zone may be partly obscured, so that there is some 
intensity, but not so great as without the screen. Or the first zone 
may be entirely uncovered, but the next ones partly obscured. In 
these cases, the contributions from the successive zones may differ so 
much that our rule of taking the first and last terms is no longer 
correct. It is possible for the whole amplitude to be more than half 
the first zone, so that the intensity is actually greater than without 
the screen. As we move into the geometrical image from the shadow, 
it tur n s out that there is a periodic fluctuation, because of the uncov¬ 
ering of successive zones, and this explains the diffraction fringes. 

Problems 

1. Try to carry out exactly the integration that we did approximately by using 
Fresnel’s zones. 

2. The source is at infinity, so that a wave front is a plane. Set up Fresnel’s 
zones, and find the breadth of the nth zone, and its area. 

3. A plane wave falls on a screen in which there is a circular hole. Investigate 
the amplitude of the diffracted wave at a point on the axis, showing that there is 
alternate light and darkness either as the radius of the hole increases, or as the 
point moves toward or away from the screen. {Suggestion; The integral consists 
of a finite number of zones.) 

4. A plane wave falls on a circular obstacle. Show that at a point behind the 
obstacle, precisely on the axis, there is illumination of the same intensity that we 
should have if the obstacle were not there. Explain why this would not hold for 
other shapes of the obstacle. 

6. Take a few simple alternating series, such as, H — M + M M + • ■ • , 
M + — * • • , and so forth, and find whether 

our theorem about the sum of a number of terms is verified for them. In doing 
this, it may be necessary to start fairly well out in the series, so as to satisfy our 
condition that successive terms differ only slightly in magnitude. 

6. Prove the statement that the boundaries of Fresnel’s zones are the inter¬ 
section of the screen with ellipsoids of revolution whose foci are the source and 
the point P. What happens to these ellipsoids as the source is removed to infinity? 

7. A “zone plate” is made by constructing 20 Fresnel zones on a plane glass 
plate for a source at infinity and blocking off the light from every other zone. 
This zone plane, when illuminated at normal incidence by a plane wave, acts as a 
lens and produces a bright spot on its axis at a distance of 1 m from the plate. 



If the incident light is monochromatic and of wave length 5,000 A, compute the 
radius of the zone plate and the intensity at the bright spot relative to its value in 

the absence of the zone plate. At what other points on the axis will there be 
intensity maxima? 

8. Consider a line source of light and a plane observation screen parallel to 
and at a finite distance from the line source. The wave fronts are cylindrical. 
Show that the Fresnel zones on a wave front at a distance d from the source are 




CHAPTER XIV 

FRESNEL AND FRAUNHOFER DIFFRACTION 


In the present chapter we proceed to the mathematical discussion 
of Fresnel and Fraunhofer diffraction, based on the methods of Huy¬ 
gens’ principle derived in Chap. XIII. The problems that we take 
up are Fresnel and Fraunhofer diffraction through a slit; Fraunhofer 
diffraction through a circular aperture; and the diffraction grating, an 
example of Fraunhofer diffraction. In Eq. (4.1) of the preceding 
chapter, we saw that the essential step in computing the diffraction 
pattern is the evaluation of the integral 

j* e -7*(H-r,) 

where k = 2tt/X, and where the integration is over the aperture of 
the screen, da is an element of surface in the aperture, r is the dis¬ 
tance from the source to the element da, and ri the distance from 
the element to the point P where the field is being computed. If 
the incident wave is a plane wave, and the plane of the aperture 
is a wave front, then r is the same for all elements, and the factor 
e~ ikr can be canceled out of the integral. The remaining integral, 
J e -J fcr i da, represents the sum at P of the amplitudes of spherical waves 
of equal intensity and phase starting from all points of the aperture. 
It is the interference of these waves which produces the diffraction 
pattern. 

1. Comparison of Fresnel and Fraunhofer Diffraction.—The two 

types of diffraction, Fresnel and Fraunhofer, arise from observing 
the pattern near to, or far from, the screen. Let the normal to the 
screen be the z axis, as in Fig. 31, and let the screen containing the 
aperture be at 2 = 0. The light passing through the aperture is 
caught on a second screen at z — R. Physically, the diffraction 
pattern has the following nature: close to the aperture, the light 
passes along the z axis as a column or cylinder of illumination, of 
cross section identical with the aperture, so that, if the screen at R 
is close to the aperture, the illuminated region will have the same shape 
a.s the aperture, and we speak of rectilinear propagation of the light. 
As R increases, however, the column of light begins to acquire fluctua¬ 
tions of intensity near its boundaries, so that the pattern on the screen 

180 
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has fringes around the edges. This phenomenon is the Fresnel 
diffraction. 

The size of the Fresnel fringes increases proportionally to the 
square root of the distance R. Thus Fig. 32 shows, in its upper dia¬ 
gram, the slit, parallel column of light, and parabolic lines starting from 
the edges of the slit, indicating the position of the outer bright fringe 
of the Fresnel pattern, if we are sufficiently near to the slit. As R 
becomes larger, the fringes become so large that there are only one 
or two in the pattern of the aperture, and the pattern shows but small 
resemblance to the shape of the aperture, though it still is of roughly 



Fiq. 31.—Xperture and screen for diffraction through rectangular slit. 

the same dimensions. With further increase of R, we finally enter 
the region of Fraunhofer diffraction. Here the beam of light, instead 
of consisting of a luminous cylinder, resembles more a luminous cone 
indicated by the diverging dotted lines in the top diagram of Fig. 32. 
Thus the Fraunhofer pattern becomes larger and larger as R increases, 
being in fact proportional to R, so that we can describe it by giving the 
angles rather than distances between different fringes. 

Often Fraunhofer diffraction is observed, not by placing the screen 
at a great distance, but by passing the light through a telescope 
focused on infinity. Such a telescope brings the light in a given 
direction to a focus at a given point of the field. Thus it separates 
the different Fraunhofer fringes, since each of these goes out from the 
source in a particular direction. In Fig. 32, diffraction patterns are 
shown indicating the transition from Fresnel to Fraunhofer diffrac¬ 
tion. Pattern a illustrates the Fresnel pattern for one edge of an 
infinitely wide slit. Patterns b to g represent the actual diffraction 
patterns from the slit, at distances indicated in the upper diagram. 
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These patterns are all drawn to the same scale. They are drawn, 
from a slit five wave lengths wide, for the sake of getting the figure 
on a diag r am of reasonable scale. If the wave length were shorter, 
then for the s am e slit the distances would be stretched out to the 









(h) 


Fig. 32.—Transition from Fresnel to Fraunhofer diffraction for a slit, (a) Fresnel 
pattern for edge of infinitely wide slit. (b)-(ff) Actual diffraction patterns from slit, at 
distances indicated in upper diagram. ( h ) Fraunhofer pattern. 

right, and the Fraunhofer pattern would correspond to smaller angular 
deflections. This would be necessary to bring the Fresnel cases far 
enough from the slit so that our approximations would be really 
applicable. Finally, in h, we give the limiting Fraunhofer pattern, not 
drawn to scale. 

Let coordinates in the plane of the aperture be x, y, and in the 
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plane of the screen at R let the coordinates be xo, yo, as in Fig. 31. 
Then, if the element of area is at x, y, 0, and the point P at x 0) y 0 , R, 
the distance between them is 

n = a/Oco — xy + (yo — 2/) 2 + # 2 . 

The integration cannot be performed with this expression for n, and 
Fresnel and Fraunhofer diffraction lead to two different approximate 
methods of rewriting 9 * 1 , leading to different methods of evaluating 
the integral. We can see the relation of these two methods most 
clearly from Fig. 33, in which r x is plotted as a function of x 0 — x, 
for the special case where yo — y is zero. The resulting curve is a 
hyperbola. 



I^ig. 33 .—r 1 as function of xo — x. ri = a /(*0 — a:) 2 -1- f2 a . ri iB the distance 
from a point of the apefture to a point on the screen; xo — x is the difference between the 
x coordinates of the points. 

Now in all ordinary cases, R is large compared with the dimensions 
of the aperture. That is, the range of abscissas representing the 
dimensions of the aperture (from xo - to a:o - x 2 , if x\ and x 2 are 
the extreme coordinates of the aperture) is small compared with the 
distance R, the intercept of the hyperbola on the axis of ordinates. 
The two cases are now represented by the ranges ab and cd of abscissas, 
respectively. In the first, xo — Xi and x 0 — x 2 are separately small, 
as well as their difference, and this means that the point P is almost 
straight behind the aperture, in the region where the Fresnel diffraction 
pattern occurs. In the second, xo is large, of the same order of magni¬ 
tude as R, showing that we are examining the pattern at a considerable 
angle to the normal, as we do in the Fraunhofer case. 
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The two approximate methods can now be simply described from 
the curve: for Fresnel diffraction, we approximate the hyperbola near 
its minimum by a parabola; for Fraunhofer diffraction, we approximate 
it farther out by a straight line. In the first case, assuming R to be 
large compared with (Xo — x), we have by the binomial expansion 

„ _ . 1 ( x o — xY , • 

r x = it -+- 2-^-r * , 

or, including the terms in y, 

r t - R + | + (2/0 — + • • • . (1.1) 

In this case, in the notation of Eq. (4.1) of Chap. XIII, we take 
f = R, so that r' is the remaining term of (1.1). For Fraunhofer 
diffraction, on the other hand, we have x 0 x. Then we can write 
r} = (x% 4* y% + P 2 ) — 2(xx 0 4 yyo) 4* x 2 4 y 2 , and we can neglect 
the terms x 2 4 y 2 . If we let R 2 = x 2 4 y\ + R 2 , where R 0 measures 
the distance from the center of the aperture to the point P, we can use 
a binomial expansion, obtaining 

^ — t? _ areo + yyo _ t ON 

T x x£ 0 • (1.2y 

In this case we take f = Ro, so that r' is the remaining term of (1.2). 
Letting x 0 /Ro = l , yo/Ro — m 7 the direction cosines of the direction 
from the center of the aperture to P, we have 

r' = — (Jx 4 my) * * * , 

involving the position on the screen only through the angles, so that 
we see at once that the pattern will travel outward radially from the 
aperture. 

2. Fresnel Diffraction from a Slit. — Let the aperture be a slit, 
extending from x = — (a/2) tor = a/2, and from y = — (b/ 2) to 6/2. 
We assume a to be small, b comparatively large, as in Fig. 31, so that 
it is a long narrow slit. Using the results of (1.1), our integral is 

f e —lhr* dd — / 

This can be immediately factored into 

/ &/2 fa/2 

e —irjCy—2 /oW-KX fiy I e —Tri(,x—x a )i/rt\ rfx. 

-6/2 J -a/2 

Since these two integrals are of the same form, we can treat just one 
of them. This will prove to give fringes parallel to one set of axes. 
The whole pattern is then simply the combination of the two sets of 
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fringes. The single integral, for instance, the one in x, has a real part, 
and an imaginary part (with sign changed), equal to 


ra/2 

J -a/ 


ir(x — Xo ) 2 7 
cos —— r——dx 

a/2 KK 


and 


ra/2 

/ sin 
J —a /2 


7r(x — Xo)‘ 

R\ 


dx. (2.1) 


It is customary in these integrals to make a change of variables: 
(x — xo) 2 


2 ” Then the integrals become s/R\/2 times C and S, 

ru, r U i 

— / cos jz u 2 du, S = I sin u z du, and where 

J ui & j Ul z 


R\ 

respectively, where C 

xo — a/2 xo -+- a/2 

u i = -—-j====^ -j uz — ■ -~^===~ -• These integrals are called Fresnel s 

integrals.” They cannot be explicitly evaluated, but their values 
have been computed by series methods. 

J* sin-|irw 2 du 




,/* cos du 


+ -0.5 


Fkj. 34.—Oornu's spiral. Tho points of the spiral marked by cross bars correspond to 

increments of 0.1 unit in u. 

To discuss Fresnel’s integrals, let us plot the indefinite integral 

fu Cu 

J o cos ^ u 2 du as abscissa, J sin ^ u2 du as ordinate, of a graph, as in 
Fig. 34. Then it is not hard to see that the resulting curve is a spiral, 
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which is known as “Cornu's spiral." To see this, we can first compute 
the slope. This is the differential of the ordinate, over the differential 
of the abscissa, or 


sin (tt/ 2)^ 2 
cos (tt/2)u z 


tan ^ u 


2 


Thus, when u 2 increases by 4, the tangent of the curve swings around 
a complete cycle, and comes back to its initial value. Each point 
of the spiral corresponds to a particular value of u. We can show 
at once that the difference of u between two points is simply the length 
of the curve between the points. We show this for an infinitesimal 
element of the curve. The square of the element of length, ds 2 , is 
equal to the sum of the squares of the differentials of abscissa and 
ordinate, or is cos 2 [(x/2)ti 2 ] du 2 + sin 2 [(x/2)w 2 l du 2 . Hence ds — du , 
and we can integrate to get s = u% — u\. From this fact we can make 
sure of the spiral nature of the curve. For one turn corresponds 
to an increase of u 2 by 4. That is, if u' and u" are the values at 
the two ends, u" 2 — u' 2 + 4. This is u" 2 — u' 2 = 4, ( u" — u') 
(u" + u') — 4, u n — v! = 4 / (u" + u'). The difference u" — u' is, 
however, simply the length of the turn, so that as we go farther 
along, the turns become smaller and smaller, so that they eventually 
become zero, which is characteristic of a spiral. It is plain that the 
spiral is symmetric in the origin, having two points, for u = ± oo, 
for which it winds up on itself. 

Let us take our spiral, mark on it the positions U\ and u% correspond¬ 
ing to the limits of our integral, and draw the straight line connecting 
these points. The length of this line will then be proportional to the 
amplitude of the disturbance, and its square to the intensity. This is 
easy to see: the horizontal component of the line is just C, and the 
vertical component S, so that the square of its length is C 2 + S 2 . 
Knowing this, we can easily discuss the fluctuations of intensity, as 
seen in Fig. 32. As xo changes, it is plain that U\ and increase 

together, their difference remaining fixed and equal to — - q Thus 

a/RX/2 

essentially we have an arc of this length, sliding along the spiral, and 
the intensity is measured by the square of the chord between the ends 
of this arc. 

Now when xo is large and negative, the arc is wound up on itself, 
so that its ends practically meet, and the intensity is zero. This is 
the situation in the shadow. As xo approaches the value — a/ 2, how¬ 
ever, U 2 approaches zero, so that one end of the arc has reached the 
center of the figure. There are two quite different cases, depending 
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on whether u 2 — U\ is large or small. If it is large (a large slit and 
relatively short distance R and small wave length), then Ui will not 
be unwound much at this point. The chord will then be half the 
value between the two end points of the spiral, and the intensity 
will be one-fourth its value without the screen, and will have increased 
uniformly in coming out of the shadow. As we go farther along the x 
direction, however, the arc will begin to wind up on the other half of 
the spiral, producing alternations of intensity at the end of the shadow. 
Then for a while u 2 will be nearly at one end of the spiral, u x at the 
other, so that the intensity for some distance will be nearly constant, 
and the same that we should have without the slit. This is the 
illuminated region directly behind the slit. Finally we approach the 
other boundary, and u\ commences to unwind. We then go through 
the same process in the opposite order. 

The other quite different case comes when w 2 — u x is small, which 
is the case for small slit, or large wave length or distance. Then 
there is never a time when u x is on one branch of the spiral and u% on 
the other. All through the central part of the pattern, therefore, 
there are no fluctuations of intensity. Such fluctuations come only 
far to one side or the other. They come about in this way: At some 
places in the pattern, the arc is long enough to wind up for a whole 
number of turns, and the chord is practically zero, while at other places 
it winds up for a whole number plus a half, and the chord has a maxi¬ 
mum. The resulting fringes are the Fraunhofer fringes which we 
shall now discuss by a different method. 

3. Fraunhofer Diffraction from a Slit.—Using the approximation 
(1.2), our integral for Fraunhofer diffraction is e~ ikR °Je 27ri<ilx+ ' m ' v ' >/x da. 
The first term, as in Fresnel diffraction, contributes nothing to the 
relative intensities, and may be discarded. We then have 


as the integral whose absolute value measures the amplitude of the 
disturbance. Let us suppose that the aperture is the same sort of 
rectangle considered above, extending from —a/2 to a/2 along x, from 
— 6/2 to 6/2 along y. Then the integral is 


fa /2 

J -a/2 


e 2icjlx/\ rfx 


fb/2 

/ e 2rfmv/\ 

J —b/2 


_ (c 


vjla/X _ 


^— 7 rjla/X^ ^>wj7nb/\ q — 7cjmb/\^ 


2rjl/\ 


2iTrjtn/\ 

sin (xZa/X) sin ( wmb/X) 


irl/\ 


7T 


m/X 


(3.1) 
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The intensity is the square of this quantity. Let us consider its 
dependence on the position of the point P on the screen. The coordi¬ 
nates of this point enter only in the expressions l, m, showing that the 
pattern increases in size proportionally to the distance, as if it con¬ 
sisted of rays traveling out in straight lines from the small aperture, 
rather than having an approximately constant size as with the Fresnel 
diffraction (see Fig. 32). When we consider the detailed behavior of 
the intensity as a function of the angle, we find that this can be written 


as a 2 


sin 2 (irla/X) 
(tt la/\y 


times a similar function of m, giving a curve of the 


form 


sm 2 a . 7 via 

- 5 —j where a — —— 

a 2 A 


This function becomes unity when a 



goes to zero for a — ic, 2 tt, 3i r, • • • , with maxima of intensity approxi¬ 
mately midway between. The, maxima decrease rapidly in intensity. 
Thus at the points 3tt/2, 5tt/ 2, . . . which are approximately at the 
second and third maxima, the intensities are only (2/3 tt) 2 , (2 /57 t) 2 , 
. . . , or 0.045, 0.016 . . . , compared with the central maximum of 1. 
Let us see how the size of the fringes depends on the dimensions of the 
slit. The minima come for a. = mr, or la/\ — n, l = rik/a. Thus we 
see that the greater the wave length, or the smaller the dimensions 
of the slit, the larger the pattern becomes. 



Fig. 35.—Elementary construction for Fraunhofer diffraction. 


The positions of the minima can be immediately found by a very 
elementary argument. Assume for convenience that we are investi¬ 
gating the pattern at a point in the xz plane, so that m = 0. Then 
draw a plane normal to the direction l, passing through one edge of 
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the aperture, as in Fig. 35. This represents a wave front of the 
diffracted wave, just as it passes one edge of the aperture. From 
the geometry of the system, this wave front is a distance la from 
the other edge, or la /2 from the middle of the aperture. Now, if the 
distance of the middle is just a whole number of half wave lengths 
different from the distance from the edge, the contributions of these 
two points to the amplitude will just cancel, being just out of phase. 
The other points of one half of the aperture can all be paired against 
corresponding points of the other half whose contributions are just 
out of phase, finally resulting in zero intensity. This situation comes 
about when la /2 = nX/2, where n is an integer, or l — n\/a, the same 
condition found above. Since most of the intensity falls within the 
first minimum, and since l is the sine of the angle between the ray 
and the normal to the surface, we may say that by Fraunhofer diffrac¬ 
tion the ray is spread out through an angle X/a. 

4. The Circular Aperture, and the Resolving Power of a Lens.— 
The problem of Fraunhofer diffraction through a circular aperture 
is slightly more complicated mathematically. Here we must evaluate 
/ e 2vi(l * +7nv)/>k da over a circle. Let us introduce polar coordinates in 
the plane of the aperture, so that x — p cos 6, y ~ p sin 0. Further, 
on account of symmetry, we may take the point P to be in the xz plane, 
so that m — 0. Then, if p 0 is the radius of the aperture, the final result 

is p dO p e* rJl> U0H el/X p dp. We integrate with respect to p by parts, 

obtaining for the integral 



pae* 9 ** OOH 0l/X 

2k j cos 0l/\ 


^£>2tT 7p 0 coh 9l/\ 2^ 

(27 rj cos 0l /\) 2 


For the integration with respect to 6, it is necessary to expand the 
exponentials in series. If we do this, the integrals are in each case 
integrals of a power of cos 0, from 0 to 27r. These are easily evaluated, 
and the result, combining terms, proves to be 



where Jc is an abbreviation for 7rpol/\. If we recall the formulas for 
BessePs functions, discussed in Appendix VII, we can see without diffi¬ 
culty that this is equal to (poX/Z) J i[27rpo(Z/X)]. It is not hard, using 
some of the properties of BessePs functions, to prove this formula 
directly, without the use of series. 
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From the series, we see that the intensity has a maximum for 
1 = 0, the center of the pattern. As l increases, we can see the 
behavior most easily from the expression in terms of Bessel's functions. 
Since J 1 has an infinite number of zeros, there are an infinite number 
of light and dark fringes. The first dark band comes at the first zero 
of J i, which from tables is at 2irpol/\ = 1.2197*71-, pol/k = 0.61. The 
next is at pol/k — 1.16, and so on, with maxima between. We see 
that, except for a numerical factor, the pattern from a circular aperture 
has about the same dimensions as that from a square aperture. Thus 
if the side of the square were equal to the diameter of the circle, 2p 0 , 
the first dark fringe would be at 2 pol/'k = 1, pol/'k = 0.5, and the next 
one at 1.0. 

Whenever light passes through a lens, it is not only refracted, but 
it has passed through a circular aperture, the size of the lens itself 
or of the diaphragm that stops it down, and as a result it is diffracted. 
Suppose, for example, that the lens is the objective of a telescope, and 
that parallel light falls on it, as from an infinitely small or distant star. 
Then after passing through the diaphragm, the light will no longer be 
a plane wave, but will have intensity in different directions, as shown 
in the last paragraph. The central maximum will have an angular 
radius of 0.61\/po, where po is now the radius of the telescope objective. 
The resulting waves are just as if the light came from an object of this 
size, but passed through no diaphragm. When the telescope focuses 
the radiation, the result will be not a single point of light, but a 
circular spot surrounded by fringes, as of a star of finite diameter. 
For this reason, the telescope is not a perfect instrument, and .one 
Would say that its resolving power was only enough to resolve the 
angle 0.61\/po. This is usually taken to mean the following: If two 
stars had an actual angular separation of this amount, the center 
of the image of one star would lie on the first dark fringe of the other, 
and the patterns would run into each other so that they could be just 
resolved. We see that the larger the aperture of the telescope, or the 
smaller the wave length, the better is the resolution. The same general 
situation holds for microscope lenses. 

5. Diffraction from Several Slits; the Diffraction Grating. —Sup¬ 
pose we have a number N of equal, parallel slits, equally spaced. 
•Let each have a width a along the x axis, and let the spacing on centers 
be d, so that the centers come at x = 0, d, • • • (N — l)d. Now let 
us find, the Fraunhofer pattern. The part of the integral depending 

on y will be just as with the single slit, and we leave it out of account. 
We are left with 
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e 2irjlx/\ dx 


+ 


rd+a/2 
J d — a/2 


G 2 wjlx/\ 


/ 


(JV— l)d+a/2 
(N-l)d—a/2 


e 1irilx/\ 


But this is, as we can immediately see, simply 



e 2vjlx/\ _|_ e 2irjld/\ e 2vjl2d/\ 6 2ir?7(JV— Dd/Xj^ 


By the formula for the sum of a geometric series, this is 



e 2TCjlx/\ fa 


( 


\ _ g2xj7iVd/X\ 

_ ^ 2 irjld/X J 


Let the first term be A , the amplitude due to a single slit, which we 
have already evaluated. Now to find the intensity we multiply this 
by its conjugate, which gives 


4 2 1 — cos (27 rlNd/N) _ . 2 sin 2 (irlNd/X) 
1 — cos (2tt ld/\) sin 2 (irld/\) 


(5.1) 


That is, with N slits the actual intensity is that with one slit, but 
multiplied by a certain factor. This factor goes through zero when 
lNd/\ is an integer, so that l equals an integer multiplied by \/Nd. 
This gives fringes with a narrow spacing, characteristic of the whole 
distance Nd occupied by the set of apertures, modulating the other 
pattern, and they are what are usually called “interference fringes,” 
since they are due, not to diffraction from a single aperture, but to 
interference between different apertures. But, in addition to this, the 
denominator results in having these fringes of different heights. The 
minimum height occurs when the denominator equals unity, when 
the fringes are of height A 2 , and the most intense fringes come when 
the denominator is zero. Here the ratio of numerator to denominator 
is evidently finite, and gives fringes of height N 2 A 2 . Thus the greater 
N is, the greater the disparity in height between the largest and small¬ 
est maximum. Evidently every Nth maximum will be high, and the 
high ones will be spaced according to the law ld/\ = k, an integer. 

Now suppose N becomes very great, as in a diffraction grating. 
Then the small maxima will become so weak compared with the 
strong ones that only the latter need be considered. The latter will 
seem to consist of a set of sharp lines, with darkness between. These 
sharp lines come, as wo have seen, at angles 6 to the normal given by 
k\ = d sin 8, where k is an integer, and sin 8 = 1. This is the ordinary 
diffraction grating formula, where k is 0 for the central image, 1 for 
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the first-order spectrum, 2 for the second order, and so on. But we 
cannot entirely neglect the fact that there are other small maxima 
near the important ones. Thus for ld/\ — k, the intensity is N 2 A 2 . 
This comes for lNd/\ = Nk. But for lNd/\ = Nk + we again 
have a secondary maximum, whose height is now 

A 2 A 2 _ A 2 

sin 2 (irZd/X) = sin 2 ir[(Nk + M)/N] = sin 2 ir[k + 3/(2iV)j‘ 


Now sin 2 (rf + H) = (||) 


approximately, if N is large, so that 


the height of the maximum is 4iV 2 A 2 /97r 2 , or about 0.045 of the height 
of the highest maximum. Thus the first few secondary maxima 
cannot be neglected. 

To get an idea of the width of the region through which the inten¬ 
sity is considerable, we may take the width of the first maximum. 
From the center to the first dark fringe, this is given by the fact that 
at the center lNd/\ = Nk, at the dark fringe Nk + 1, so that 
A l = \/Nd. This is closely connected with the resolving power of a 
grating. For a single frequency gives not a sharp set of lines, one for 
each order, but a set broadened by the amount we have found. Thus 
two neighboring frequencies, differing by AX, could not be resolved 
if the first minimum of one lay opposite the maximum of the other. 
Since l = \k/d, this would be the case if A l = A \k/d — \/Nd, or if 
AX/X = 1 /Nk. The resolving power thus increases as the number of 
lines in the grating increases, and as the order of the spectrum increases. 


Problems 

1. Carry through a discussion of Fresnel diffraction from a slit, when the 
source is at a finite distance, directly behind the center of the slit. In what ways 
will the result differ from the case we have discussed? 

2. Light of wave length 6,000 A falls in a parallel beam on a slit 0.1 mm broad. 
Work out numerical values for the intensity distribution across the slit, at three 
distances, first in which the Fresnel fringes are small compared with the size of the 
pattern, second in which they are of the same order of magnitude, and third in 
which they are Fraunhofer fringes. Either construct Cornu’s spiral yourself, from 
tables of Fresnel’s integrals, or use the one of Fig. 34. 

3. Find the coordinates of the points at which Cornu’s spiral winds up on itself. 
From the chord between these points, compute the intensity behind an infinitely 
broad slit, which essentially means no slit at all. Find whether this agrees with 
what you should expect it to be. 

4. Prove that the maxima of the function 

sin 2 (-7rZa/X) _ sin 2 a 
(■jtZo/X) 2 a 2 
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are determined by the equation a — tan a. Find the first three solutions of this 
transcendental equation, and compare them with the approximate solutions 
a =*■ 3ir/2, 5ir/2, 7vr/2>. 

5. Discuss the Fresnel diffraction pattern caused by an edge coincident with the 
y axis, the screen occupying one-half the xy plane. The diffraction pattern is 
obtained in a plane parallel to the xy plane and a distance R from it. Plot the 
variation of intensity of light along the x direction from a region inside the shadow 
to well into the directly illuminated area. Prove that the intensity of light just 
at the edge of the geometrical shadow is one-fourth of its value if there were no 
diffraction edge. 


f U Tj- f U 1V 

6. Evaluate the Fresnel integrals / cos ^ w 2 du and / sin ^ u 2 du in a power 
series. What is the range of convergence of these series? 

7. Consider the Fresnel integrals in the form f cos x 2 dx and f sin x 2 dx. 


Integrate each of these by parts according to the scheme 



cos x* dx = x cos x 2 + 2 



a; 2 sin x 2 dx 


and continue this process. Show that one obtains series for the above integrals 
of the form 


i: 


cos x % dx = S\ cos x 2 ■+■ Si sin a: 2 


and 


J ^ sin x 2 dx = Si sin a; 2 — S., cos x 2 


where Si and S 2 are power series in x. Find these series and their range of 
convergence. 

8. Find a semiconvergent series for the Fresnel integrals by the following 

r qo r o° 

method: Write / cos x 2 dx — / - x dx, and integrate by parts, repeating 

Jx Jx x 

the process. Then write the results in the form 



cos a; 2 dx — <r \ cos x 2 


— <t 2 sin a; 2 + R c 


where <rx and <ra are finite series of n terms each in inverse powers of x. R c is the 
remainder. Show that it is given by 


_ 1 ■ 3 • 5 • • ■ (4 n — 1) I “ cos a: 2 dx 

R r - (- 1 )" 2 ^ Jx x*“ 

Note that the series <ri and <r-i are alternating series with terms that initially decrease 
with increasing n but eventually increase again. Thus the best approximation 
will be obtained when one stops at such a value of n that the ratio of the remainder 
when n - f- 1 terms are retained to that for n terms is most nearly equal to unity. 
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APPENDIX I 
VECTORS 


The study of vectors and vector operations involves two branches: 
vector algebra, including the addition and multiplication of vectors, 
and the relations between the components of vectors in different 
coordinate systems; and vector analysis, including the differential 
vector operations, corresponding integral relations, and the general 
theorems concerning integrals. We shall treat both subdivisions in 
the present appendix. 

Vectors and Their Components. —We shall denote a vector, a 
quantity having direction as well as magnitude, by bold-faced type, as 
F. Vectors are often described by giving their components along 
three axes at right angles, as F X) F y , F z . Their mathematical relation¬ 
ships are conveniently stated in terms of their components. Thus 
their law of addition is the paral¬ 
lelogram law; in terms of compo¬ 
nents, this means that, if two 
vectors F and G have components 
F x , F V) F z , and G x , G y , G Z} respec¬ 
tively, the components of the sum 
F + G are (F x + G x ), (F v + G v ), 

(, F e + Gz), as we show graphically 
in Fig. 36. To multiply a vector 
by a constant, as C, the vector 
must be increased in length by the 
factor Cj leaving its direction un¬ 
changed; this amounts to multi¬ 
plying each component by the 
constant C , so that the compo¬ 
nents of CT are CF X , CF V , CF Z . 

Often a constant like C is called a 
“scalar/’ to distinguish it from a vector. A scalar is a quantity that 
has magnitude but not direction, a vector having both magnitude and 
direction. 

It is often useful to write vectors in terms of three so-called “unit 
vectors,” i, j, k. Here i is a vector of unit length, pointing along the 

195 



Fio. 30.—Parallelogram law for addi¬ 
tion of vectors. The voctor F + G, the 
diagonal of the parallelogram of sides F, 
G, is the vector sum of F and G. Its x 
component is F x + G x , its y component 
F v -h G v . 
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x axis, and. similarly j has unit length, and points along the y axis, and 
k along the z axis. Now we can build up a vector F out of them, by 
for min g the quantity iF x -j- j F y + kF z . This is the sum of three 
vectors, one along each of the three axes; and the first, which is just 
the component of the whole vector along the x axis, is F x , and the 
other components likewise are F v and F z . Thus the final vector has 
the components F X} F v , F z , and is just the vector F. 

By the magnitude of a vector we mean its length. By the three- 
dimensional analogue to the Pythagorean theorem, by which the 
square of the diagonal of a rectangular prism is the sum of the squares 
of the three sides, the magnitude of the vector F equals \/F x -+* Fy + j F\. 
We often speak of unit vectors, vectors whose magnitude is 1. The 
component of a vector in a given direction is simply the projection 
of the vector along a line in that direction. It evidently equals 
the magnitude of the vector, times the cosine of the angle between 
the direction of the vector and the desired direction. As a special 
example, the component of a vector F along the x axis is F x , and this 
must equal the magnitude of F, times the cosine of the angle between 
F and x. If this angle is called (F,x), then we must have 


cos (. F,x ) 


F m 

VF% + FI + FI 


with similar formulas for y and z components. The three cosines of 
the angles between a given direction, as the direction of the vector 
F, and the three axes, are called “direction cosines,” and are often 
denoted by letters l , m, n, so that in this case we have l = cos (. F,x ), 
etc. It follows immediately that l 2 + m 2 + n 2 = 1. We can give a 
simple interpretation of the direction cosines of any direction: they 
are the components of a unit vector in the desired direction, along the 
three coordinate axes. 

Scalar and Vector Products of Two Vectors.—Multiplication of 
two vectors is a somewhat arbitrary process, governed by rules that 
we must postulate. It has proved to be convenient to define two 
entirely independent products, called the “scalar product” and the 
“vector product.” We shall first consider the scalar product. The 
scalar product of two vectors F and G is denoted by F • G, and by 
definition it is a scalar, equal to either (1) the magnitude of F times 
the magnitude of G times the cosine of the angle between; or (2) the 
magnitude of F times the projection of G on F; or (3) the magnitude 
of G times the projection of F on G. From the preceding section we 
see that these definitions are equivalent. 
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It is often useful to have the scalar product of two vectors in terms 
of the components along x, y, and z. We find this by writing in terms 
of i, j, and k. Thus we have 

F • G = (iF x + j F y + k F.) • (i G x + }G y + k G.) 

= (i • W, + (i • j )F x G y + (i • )s.)F x G z + (j . i)F y G m 
+ (j • j )F y Gy + (j * k )F y G z + (k • i)F z G x + (k - j )F Z G V + (k • k )F Z G Z . 

But, by the fundamental definition, 

i*i — j*j = k • k = 1, 

i. j = j . i = j . k = k . j = k • i = i . k = 0. (1) 

Thus ' 

F • G = F X G X + F v G y + F Z G Z . (2) 

The scalar product has many uses, principally in cases where we are 
interested in the projection of vectors. For example, the scalar 
product of a vector with a unit vector in a given direction equals 
the projection of the vector in the desired direction. The scalar 
product of a vector with itself equals the square of its magnitude, 
and is often denoted by F 2 . The scalar product of two unit vectors 
gives the cosine of the angle between the directions of the two vectors. 
To prove that two vectors are at right angles, we need merely prove 
that their scalar product vanishes. 

The vector product of two vectors F and G is denoted by (F X G), 
and by definition it is a vector, at right angles to the plane of the 
two vectors, equal in magnitude to either (1) the magnitude of F times 
the magnitude of G times the sine of 
the angle between them; or (2) the 
magnitude of F times the projection 
of G on the plane normal to F; or 
(3) the magnitude of G times the 
projection of F on the plane normal 
to G. We must further specify the 
sense of the vector, whether it points 
up or down from the plane. This is 
shown in Fig. 37, where we see that 
F, G, and F X G have the same rela¬ 
tions as the coordinates x, y, z in a 
right-handed system of coordinates. 

Another way to describe the rule in words is that, if one rotates 
F into G, the rotation is such that a right-handed screw turning in 
that direction would), be driven along the direction of the vector 



Fig. 37.—Direction of the vector 
product. 
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product. From this rule, we note one interesting fact: if we inter¬ 
change the order pf the factors, we reverse the vector. Thus 

(F X G) = -(G X F). 

We can compute the vector product in terms of the components, 
much as we did with the scalar product. Thus we have 

F X G = (i X i )F X G X + (i X j )F x G y + (i X k )F X G Z 

+ (j X i )F y G x + (j X j )F v G y + (j X k )F v G e 

+ (k X i )F z G m + (kX j )F z G y + (k X k )F Z G X . 

But now, as we readily see from the definition, 

(i X i) = (j X j) = (k X k) = 0 

(as, in fact, the vector product of any vector with itself is zero),- and 

(i X j) = — (j X i) = k, (j X k) = - (k X j) = i, 

(k X i) = — (i X k) = j. (3) 

Hence, rearranging terms, we have 

F X G = i (F V G Z - F z G y ) + j (FzG x - F X G S ) + k (F x G y - F y G x ). (4) 

The Differentiation of Vectors. —We have seen that there are at 
least three processes of multiplication involving vectors: the multi¬ 
plication of a vector by a scalar, the scalar product of two vectors, 
and the vector product of two vectors. In a somewhat similar way 
there are a number of differential operations involving vectors, all 
with their special uses. The simplest of these is the differentiation 
of a vector with respect to a scalar, such as the time, in which the 
x , y, and z components of the time derivative of a vector are simply 
the time derivative of the x, y, and z components of the vector. 

The next type of differentiation that we consider is the differentia¬ 
tion of a scalar with respect to x, y, and z , to give a vector, the gradient, 
which we encountered in Chap. I, Sec. 3, in discussing the relationship 
between the electric field E and the scalar potential. The gradient 
of a potential tp is defined as 

grad„ = (iA + j ^ + k |)^. 

Here we have written the gradient symbolically as the product of a 
vector operator, and the scalar <p. This vector operator is ordinarily 
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denoted by a special symbol V (pronounced “ del ”), defined by 


V = 




(5) 


Whenever we use the operator V, we understand that the differentia¬ 
tions are to operate on whatever appears to the right of the operator. 
With this definition, we see that we may write the identity 

grad (p = V<p. 


In some texts on vectors, the gradient is simply denoted by v<p* 
As we see in Chap. I, Sec. 3, the physical significance of the gradient 
is simple. The gradient of a scalar points at right angles to the 
surfaces on which that function is constant, or in the direction in 
which the function changes most rapidly with position, and its mag¬ 
nitude measures the rate of change of the function in that direction. 
Its component in any direction, often called the “directional deriva¬ 
tive” of the function in that direction, measures the rate of change 
of the function in that direction, as for instance d<p/dx measures the 
rate of change of <p with x. 

Using the vector operator V, we can now define two types of 
derivatives of a vector: if we have a vector F, we can define derivatives 
by taking the scalar or the vector product of V with the vector, result¬ 
ing in V • F and V X F. These are the quantities defined as the 
divergence and the curl. From the definition (5), we have 


and 
curl F 



There are also various vector operations involving second derivatives. 
The most familiar ones involve the operator V * V = V 2 , which written 
out is 


V 2 = 


a 2 

dx 


a 2 


a 1 i «> "I 
.2 a?y“ 


dz 2 


This is the operator that we encounter often in the wave equation, 
Laplace’s equation, etc., and that is often called the “Laplacian,” 
for that reason. Being a scalar operator, it can be applied to either 
a scalar or a vector, and both forms frequently occur. Another 
second differential operator that is encountered often in electromag- 
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netic theory, though not often in mechanics, is the vector operator 
curl curl, applied to a vector: curl curl F = V X (V X F). We prove 
in a problem the useful relation 


curl curl F = grad div F — V 2 F, 

reducing this operator to the Laplacian, which we have already met, 
and to grad div F = V(V • F). This completes the list of the vector 
operations that are often encountered. 

The Divergence Theorem and Stokes’s Theorem.—There are two 
vector theorems involving integrals, somewhat similar in principle to 
the theorem regarding integration by parts in calculus, which are of 
great importance in vector analysis. These are the divergence 
theorem, sometimes known as Gauss’s theorem, and Stokes’s theorem. 
We shall now prove these theorems. The divergence theorem relates 
to a closed volume V in space, and the surface S that bounds it. It is 
assumed that there is a vector function of position F. The theorem 
then states that the surface integral of the normal component of F, 
over the surface S, equals the volume integral of div F, over the vol¬ 
ume V. That is, 




F • n da 


US div F dv , 
v 



where n is unit vector along the outer normal, so that F • n is another 
way of writing F n , the component of F along the outer normal. To 
prove our theorem, we start by dividing up the volume V into t hin 



Fig. 38.—Construction for the divergence theorem. 


elements bounded by planes y — constant, z — constant, as in Fig. 38. 
The component F x will be a function of x along such an element, and 
we have obviously 



(7) 


where X\ } are the values of x at which the element cuts through 
the surface S. Let ni, n 2 be the outer normals at these two ends of the 
element, and let dai, da^ be the corresponding areas of surface bounding 
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the ends of the element. We shall have 


—da i cos (n\,x) = dy dz 
da% cos (n 2 ,x) = dy dz 


where (ni } x) and (n 2 ,x) refer to the angles between the vectors n x and 
n 2 and the x axis, where the negative sign in the first equation arises 
because the outer normal has a negative projection on the x axis 
at the end X\, and where dy, dz represent the thickness of the element 
along the y and z directions. 

Multiplying both sides of (7) by dy dz, we then have 

y "* 2 dF 

dx dy dz — [F x i cos (n x ,x) da-i + F x2 cos (n 2 ,x) da*]. 

xi vX 


If we carry out a summation over all elements of this type, the integral 
on the left will become a volume integral over V, the sum on the right 
will become a surface integral over S, and we have 




F x cos ( n,x ) da. 


( 8 ) 


We now proceed similarly with y and z, breaking up the volume into 
thin elements with their axes along y and z, and obtain two other 
equations similar to (8). Adding them, we have 



BFv 

dy 


+ 



dv 



[F x cos ( n,x) + F v cos ( n,y ) + F z cos (n,z)] da. 


But the integrand on the right is just the scalar product F • n, so that 
we have proved our theorem (6). This theorem, the divergence 
theorem, is met in Poisson’s equation, the equation of continuity, and 
other places, and forms the basis of Green’s theorem, which is proved 
in Sec. 4, Chap. II. 

A simplified version of our proof, though not entirely satisfactory, 
is instructive. This comes from proving the theorem for a volume 
consisting of an infinitesimal rectangular parallelepiped, bounded by 
x, x + dx, y, y + dy, z, z + dz. First we compute the surface integral 
of F n over the six faces of the volume. For the face at x, F n is — F X) 
and the integral is —F x (x) dy dz, where the value of F x is to be com¬ 
puted at x. For the face at x + dx, F n is F x , and the integral is 
F x (x + dx) dy dz. Thus the surface integral over these two faces is 
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approximately (dF a /dx ) dx dy dz. Adding similar contributions from 
the faces normal to the y and z axes, we find for the total surface 
integral the amount div F dx dy dz , which is what the divergence 
theorem would give. This treatment suggests a simple physical 
definition for the divergence of a vector: it is the total outward flux 
of the vector per unit volume. To get from this form of the theorem 
to that involving a finite volume, we may subdivide the finite volume 
into infinitesimal parallelepipeds. The total flux outward of the vector 
through the surface of the finite volume is just equal to the sum of the 
fluxes outward over the surfaces of the infinitesimal volumes; for at 
each interior surface of separation between infinitesimal volumes, the 
flux outward from one volume is just balanced by the flux into its 
neighbor, leaving only the contributions of the exterior surfaces. The 
weakness of this proof lies only in the fact that a real volume cannot be 
built up entirely of infinitesimal parallelepipeds; around the boundary 
there would have to be infinitesimal volumes with surfaces inclined 
to the coordinate planes, which this treatment does not consider. Our 
earlier treatment, however, removes this objection. 

Next we consider Stokes's theorem. This theorem relates to a 


closed line L in space, and a surface S that is bounded by L. Again 
we have a vector function of position F. Stokes's theorem states 
that the line integral of the tangential component of F, around L, 
equals the surface integral of the normal component of curl F, over S. - 
That is, 



• ds = 


// 

5 


(curl F) 


• n da, 


(9) 


where ds is a vector element of distance around the boundary, and n 
is the normal to the surface, chosen to point up if the positive direc¬ 
tion of traversing the boundary is chosen (that is, if we go around 
it in a counterclockwise direction). We shall give a simplified dis¬ 
cussion of this theorem, similar to the second method we used for 
handling the divergence theorem. Let us subdivide the surface S 


into a set of small approximately 



Fig. 39.—Surface for discussing Stokes’s 
theorem. 


rectangular areas. We shall prove 
the theorem for each of the sepa¬ 
rate areas, and then combine them. 
As we see from Fig. 39, the contri¬ 


butions to the line integrals from the 
interior boundaries will cancel, so that the sum of all the line integrals 
will equal the line integral around the perimeter. The sum of the 
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surface integrals over the separate rectangles will clearly equal the 
surface integral over the whole surface. Thus by adding the separate 
contributions we arrive at the theorem (9), which we wish to prove. 
The only reservation is that the line integral is computed for a saw¬ 
tooth type of curve approximating the actual boundary. It is not 
hard to show, though we shall omit it, that the line integral /F • ds 
over a saw-tooth curve approximating the actual curve sufficiently 
smoothly differs by a negligible amount from the integral over the 
actual curve, so that we can construct a rigorous proof of the theorem 
without trouble. 


We must then prove our theorem (9) for a small rectangular area. 
Let us choose the x and y axes to point along 

the sides of the rectangle, the z axis along the x ’ x x + dx > y *~ dy 

normal n; if we can prove the result in this 

coordinate system, it will have to hold in any A 

other coordinate system as well. The rectan- I_>_ 

gle is considered to be bounded by the values x,y x+dx, y 


x, x + dx, y, y + dy, as in Fig. 40. The 
surface integral on the right of (9) is then 


Fio. 40.—Circuit for prov¬ 
ing Stokes’s theorem. 


( dFy/dx — dF x /dy ) dx dy. Let us next compute /F • ds for the element 


of area. It is evidently 


F X (x,y) dx + F y {x + dx,y) dy 


F*(x,y + dy) dx — F v (x,y) dy 


(dFy _ 

\dx dy J 


dx dy, 


if we go around so as always to keep the surface on the left. Thus the 
theorem (9) is true for the infinitesimal rectangular surface, and by 
the argument above it then holds for a finite surface as well. 

The physical meaning of the curl is easily seen from Stokes’s 
theorem: it is the line integral of the tangential component of the 
vector around an area, per unit area. A vector field has a curl when 
its lines of force close on themselves, like the magnetic lines around a 
wire carrying an electric current, or like the lines of flow in a whirlpool¬ 
like motion of a fluid. A simple example, which makes the significance 
of the curl very clear, is the vector velocity associated with the rigid 
rotation of a body. Let a body have an angular velocity co about the 
z axis. Then the linear velocity at point x, y, z is given by v x = — coy, 
v v = cox, v z = 0. We compute the curl, and find at once that it is 
along the z direction, has a constant value independent of position, 
and is of magnitude 2 co. Thus it is simply twice the angular velocity 
vector of the rotating motion. 
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Problems 


1. Find the angle between the diagonal of a cube and one of the edges. (Hint: 
Regard the diagonal as a vector i + j + k.) 

2. Given a vector i + 2j + 3k, and a second i — 2j + ak, find a so that the 
two vectors are at right angles to each other. 

3. Prove that lx + my + nz = k, where l, m, n, k are constants, and 
Z 2 -f- m 2 -+• n 2 = 1 is the equation of a plane whose normal has the direction 
cosines Z, m, n, and whose shortest distance from the origin is k. 

4. Prove that A • (B X C) = B * (C X A) — C • (A X B), where A, B, C are 
any vectors. Show that these are equal to the determinant 



6. Prove that A X (B X C) — B(A • C) — C(A • B), where A, B, C are any 
vectors. 

6. Prove that div dE <= a div F + (F • grad a), where a is a scalar, F a vector. 

7. Prove that curl aF = a curl F -f- [(grad a) X F], where a is a scalar, 
F a vector. 

8 . Prove that div (F X G) = (G ■ curl F) — (F • curl G), where F, G are 
vectors. 

9. Prove that div curl F = 0, where F is any vector. 

10. Prove that curl curl F = grad div F — V 2 F, where F is any vector. 

11 . Prove that curl grad a — 0, where a is any scalar. 
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One of the complications of electromagnetic theory is the fact 
that a number of different systems of units are in common use. There 
are two so-called “absolute” cgs systems, both based on the centi¬ 
meter as unit of distance, gram as unit of mass, and second as unit 
of time. These are the electrostatic and electromagnetic units, respec¬ 
tively (often abbreviated esu and emu). There is a combination of 
these, in which certain quantities are denoted in the electrostatic and 
certain others in the electromagnetic units, which is called the “Gauss¬ 
ian,” or “mixed,” units. There are the so-called “practical units,” 
based largely on the electromagnetic units. There is the system used 
in this volume, the so-called mks system, in which, by using practical 
units for the electromagnetic quantities, but by using meters as units 
of distance, and kilograms as units of mass, we convert the practical 
system into an absolute system (that is, into a system in which various 
numerical constants can be set equal to unity, so that they drop out 
of the equations). Finally, in all these systems, we have a choice of 
so-called “rationalized” and “unrationalized” units, differing by 
factors of r, in which the unrationalized units give simpler formulas 
when we are dealing with problems of spherical symmetry, as the 
force between two point charges (where the force field is spherically 
symmetrical), while the rationalized units give simpler formulas when 
we are dealing with problems with rectangular symmetry, as the 
capacity of a parallel-plate condenser, or the properties of a plane wave. 
Of all these systems, the unrationalized Gaussian units, and the 
rationalized mks units, have the widest use and are the most valuable. 
In this appendix we take up the relations among these units, and the 
definitions of all the types of units. 

We start with the unrationalized electrostatic units. Here the 
fundamental definition is that of the unit charge. It is so chosen as to 
eliminate the constant in Coulomb's law, which we have stated in 
mks units in Eq. (2.2), Chap. I. That is, in esu, the force in dynes 
between two unit charges is given by 
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where r is in centimeters; or unit charge is that charge which, placed 
at a distance of 1 cm from an equal charge, acts on it with a force of 
1 dyne. The corresponding rationalized electrostatic units would be 
defined by the statement F = qq'/Anrr 2 , so that unit charge would 
be that which, placed at a distance of 1 cm from an equal charge, 
acts on it with a force of 1 /Air dynes. That is, the unit charge in rational¬ 
ized units is 1 /V^x times as great as in unrationalized units. We shall 
not, however, consider rationalized esu further. 

In unrationalized esu, we define unit current as that in which unit 
charge passes any point of the circuit per second. We define unit 
electric field E from the force equation, which we have given in 
Eq. (1.1), Chap. I, for mks units. That is, we assume that 

F = <zE, (2) 

for a static field, so that unit field is that in which unit charge is 
acted on by a force of 1 dyne. Electrostatic unit of potential is that 
potential difference which will impart to unit charge an energy of 
1 erg. It is clear from this that the field of a charge q, at a distance r, 
has a xnagnitude q/r 2 , and a scalar potential q/r, in unrationalized esu. 
The units of D, in unrationalized esu, are determined by the arbitrary 
assumption that D = E in empty space, in these units, so that the 
units of D are the same as those of E. If now we have a field 



from a charge q, the outward flux of D can be found at once by con¬ 
sidering a sphere, and is 47r q. Thus Gauss’s law becomes 



n da = 4tt 



47T 



and the corresponding differential expression is 


div E = 4-7rp. 


(3) 

(4) 


With the formulas relating charge and field, we can find the 
capacity of a condenser in esu, where of course capacity is defined 
as the charge per unit potential difference. For a parallel-plate con¬ 
denser of area A sq cm, with a separation of d cm between the plates, 
and empty space between, the capacity proves to be 

A 

4t rd 


C = 


(5) 
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We note that the capacity, being an area divided by a length, has the 
dimension of length in these units; it is sometimes expressed in centi¬ 
meters. We also note that the factor Anr, which did not appear in 
Coulomb's law (1) in unrationalized units, has appeared in the formula 
for the capacity of a parallel-plate condenser, just the reverse of the 
situation with rationalized units. If now the condenser is filled with a 
dielectric, the capacity will be k 6 times as great, where K e , the dielectric 
constant, being a dimensionless quantity, is the same in all systems of 
units. The situation is complicated by the fact, however, that in 
unrationalized esu the quantity k 6 is often denoted by e, though with a 
quite different meaning for e from what we have in our mks units. 
The capacity of a parallel-plate condenser filled with a dielectric, in 
unrationalized esu, is then 


C = 


k 6 A eA 

4a -d OT M 


( 6 ) 


It will be noted that this is related to the value eoA/d, given in Sec. 2, 
Chap. II, for the mks units, by having <= in place of « 0 (since the dielec¬ 
tric is not equivalent to free space), and by being divided by Air. 
Similarly any of the other expressions for capacities of condensers of 
various shape, given in that section, can be converted to unrationalized 
esu by changing e 0 to e, and by dividing by Air. 

A dielectric of course derives its properties from the dipoles within , 
it. The potential of a dipole of moment m (which, of course, is the 
product of the charge, in esu, times the separation of the opposite 
charges, in centimeters) is m cos 6/r 2 , instead of the value 

m cos 0 
Aireor 2 


which we have in mks units. We set up a polarization vector P, 
equal to the dipole moment per unit volume in a dielectric, just as we 
did in mks units. As before, we find the volume charge density arising 
from the polarization to be given by the relation div P — — p'. 
Assuming, as in Chap. TV, that E is the field arising from all charge, 
real and polarized, we have div E = Air(p + p') = 47rp — Air div P, 
div (E + 4 ttP) = 4irp. Thus, comparing with (4), we see that 


Kfl — 1 H- Jjj- = 1 + 47TXe, 


(7) 


where x*, the susceptibility, in the unrationalized units, is clearly 
1 /Air times as great as in rationalized units, as in Eq. (2.3), Chap. TV. 
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Taking account of the different definition of the dipole moment, we 
see that, in place of Eq. (1.3), Chap. IX, giving the dielectric constant 
of a material containing polarizable molecules, we have 



1 * 4 ~ Air 


? 


«2 


Nke^/m 
— a? 2 + joigk 


( 8 ) 


differing from the formula in mks units by a factor Arreo in the second 
term. 

We have now discussed most of the electrostatic relations in 
unrationalized esu. One additional quantity is the conductivity, 
which is defined by O hm ’s law J = <rE, in terms of the esu definitions 
of current density and electric field. When we come to magnetic 
quantities, it is less convenient to use esu. We can do so, by defining 
a magnetic field in terms of the force on a current element, but such 
magnetic units are seldom used, and we shall not discuss them here. 

We next consider the unrationalized electromagnetic units. These 
are based on the law of force between two current elements as a 
starting point. Just as the esu of charge is chosen so as to eliminate 
the constant in Coulomb's law, so the emu of current is chosen so as to 
e limina te the constant in Eq. (1.4) of Chap. Y, giving the force between 
two current elements. Thus in emu we have for that equation 


dF = id 2 


[dsi X (ds 2 X r)] 


(9) 


That is, unit current in the electromagnetic system is that current 
which, flowing in 1 cm length of wire, acts on a similar current 1 cm 
away, appropriately oriented, with a force of 1 dyne. Having defined 
unit current, we can define unit charge in the electromagnetic system 
as the charge crossing a given point per second, when unit current is 
flowing. We can define the emu of potential as the potential differ¬ 
ence that will impart to 1 emu of charge an energy of 1 erg. We thus 
note that we can get definitions of electrostatic quantities in the electro¬ 
magnetic system of units. It is then a question of experiment (involv¬ 
ing in principle the measurement of forces between charges, and forces 
between currents, and comparing the charges and currents) to find 
the relation between the electrostatic and electromagnetic units of 
charge, current, and potential difference. It is found that 


1 emu of charge = 2.998 X 10 10 esu of charge 
1 emu of current = 2.998 X 10 10 esu of current 
2.998 X 10 10 emu of potential = 1 esu of potential. 


(10) 
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The quantity 2.998 X 10 10 appearing in the ratio of units is that 
which proves to express the velocity of light in centimeters per second, 
and for most purposes it can be approximated by 3 X 10 10 cm/sec. 

The magnetic induction B is determined in the electromagnetic 
system in such a way that the force equation F = q(v X B) or force 
per unit volume = (J X B) will hold, without additional constants. 
Thus unit magnetic induction, in emu, is that induction in which a 
unit current flowing in a wire 1 cm long at right angles to the magnetic 
field is acted on by a force of 1 dyne. This emu of m agnetic induction 
is almost the only one of the absolute units that is in common use: 
it is the gauss. We then find that the induction arising from an 
element of current is 


dB = 


i ds X r 

l-i* * 


( 11 ) 


the appropriate form of the Biot-Savart law for emu. Using this 
law, we can find the value of B arising from different forms of stationary 
currents. Since (11) is related to Eq. (1.3) of Chap. V, the correspond¬ 
ing formula in mks units, by lacking the factor no/Air, it is clear that 
all the formulas for B in Secs. 2 and 3 of Chap: V can be converted to 
emu by multiplication by 47 t/mo. Thus the value of B at a distance R 
from an infinite straight wire carrying a current of i (in emu) is 2 i/R, 
and the value of B inside a solenoid with n turns per unit length is 
Ami. The solid angle £2 intercepted by a wire carrying a current of i 
emu can still be used as a scalar potential, and B is given by 


B — —i grad £1. (12) 

If we define the strength of a magnetic dipole as the current, in emu, 
times the area, the scalar potential, whose negative gradient gives B, 
is m cos 0/r 2 . The vector potential A still is related to B by the equa¬ 
tion B = curl A, where now A is determined by the equation 


A = 



(13) 


instead of by Eq. (0.4), Chap. V. 

From (12), we can see at once that Ampere's law for empty space is 


JB * ds 

or, in differential form, 

curl B 


4Si, 

(14) 

4ttJ. 

(15) 


In a magnetic medium, however, we have a magnetization current J' 
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arising from magnetic polarization. Defining magnetic moment as 
we have just done, we can set up a magnetization vector M as the 
total magnetic polarization per unit volume, and we find, as in Eq. 
(1.2), Chap. VI, that 

J' = curl M. (16) 

We then find, as in Chap. VI, that in Ampere’s law in a magnetic 
medium, we must include the effects of all currents, both real currents 
and those arising from polarization, so that (15) is replaced by 

curl B = Air(J + J') = 4ttJ + 4tt curl M, 
curl (B — 4 ttM) = 4?rJ. 

We then define a magnetic field H, by the equation 

H = B - 4ttM, 

in terms of which Ampere's law becomes 

curl H = 4irJ. (17) 

We see that this definition is so set up that B = H in a nonmagnetic 
medium. The relation between B and H may be expressed in the 
forms 

B = H + ArrM = K m H, Km — 1 + 4t rx m , where M = XmH. (18) 

These relations are clearly analogous to the corresponding electrostatic 
relations (7) and (8). 

Faraday’s law, in unrationalized emu, takes the forms 

y*E*ds= — ^ J B • n da, curl E = — —• (19) 

To see that these formulas, without additional constants, give the 
correct value, where E and B are expressed in emu, we may use the 
familiar derivation of Faraday’s law from the energy principle, a 
derivation that we did not give in the body of the text. In this 
derivation, the flux through a circuit is changed by deforming the 
circuit, keeping the value of B constant in time, rather than by chang¬ 
ing B with a constant circuit. The force exerted by the magnetic 
field on the element of length ds of wire carrying current i is t‘(ds X B), 
and the work done by this field, if the wire is displaced by an amount 
du, is ij(ds X B) • du = i/B • (du X ds). The quantity du X ds is a 
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vector equal in magnitude to the increment of area of the circuit, and 
pointing along the normal n to the surface spanning the circuit. Thus 
the work done, as computed above, may be rewritten id(fB • n da ). 
To do this external work, the current flowing in the wire must have 
lost a compensating amount of energy, which means that an emf 
JE • ds must have existed, satisfying the relation 

id(f B • n da) = —i dtfE • ds, 

from which Faraday's law follows at once. From Faraday's law, and 
the Biot-Savart law (11), we see at once that Eq. (2.2) of Chap. VII, 
for the coefficient of self- or mutual induction, is written in unrational¬ 
ized emu in the form 

L or M = j I (20) 

from which we see that the formulas of Chap. VII for inductances 
can be converted to emu by dividing by the factor ho/Att. 

We have now' seen how to express the electrical quantities in 
unrationalized esu, and the magnetic and electrical quantities in 
unrationalized emu. When we combine these, to get Maxwell's 
equations, it is customary not to use either of these systems of units 
consistently, but to combine them into the Gaussian units. In these 
units, we use emu for B and H, but esu for all other quantities, includ¬ 
ing current and charge. The electrostatic equations will then be as 
in esu, but the equations involving magnetic quantities will be changed. 
We have the relation (10) between esu and emu of charge, current, 
potential, and correspondingly of electric field. If we let 

c = 2.998 X 10 10 cm/sec, (21) 

we see that J, expressed in emu, is equal to 1 /c times J expressed 
in esu, while E expressed in emu is equal to c times E expressed in esu. 
Thus Maxwell’s equations in Gaussian units become 

curl E = — - div B = 0 

C dt 

curl H = - + 4 tt —j div D — 4 irp, (22) 

c at c 

where we have included the displacement-current term, whose correct¬ 
ness may be inferred from the fact that it satisfies the equation of 
continuity, div J + dp/dt = 0. These equations are to be supple- 
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mented by the constitutive equations, 

D = k c E, B = kJcL, (23) 

where k 9 and K m are as given in (7), (8), and (18), and where in Gaussian 
units they are often denoted by € and ju, respectively. Ohm's law is 
still stated in the form J = <rE, where the value of the conductivity <r 
is the same in Gaussian units that it is in esu. 

From Maxwell’s equations in Gaussian units, we can proceed 
to introduce the potentials, set up wave equations, and consider 
energy density and the Poynting vector. For the potentials we have 

B == curl A, E = — grad <p — - (24) 

c dt 

in which <p and A are subject to the relation 


div A + 


K-e^m d<p 
C dt 


(25) 


Then we find that the potentials satisfy the following equations, if 
K e and K m are constants independent of position: 


vv 


V 2 A 


K-eK-m d\ 

C 2 dt 2 

~dF 


= — 4wK*i- 


(26) 


S imil ar wave equations are satisfied by the components of electric 
and magnetic field. They show that, in a region containing no charge 
or current, waves will be propagated with a velocity c/V^Km, verify¬ 
ing our earlier statement that c represents the velocity of light in 
empty space. In a conducting medium in which J = <rE, and in 
which the charge density is zero, we have instead of (25) 


div A + + 


and instead of (26) 


vv 


47 TK m <T d<p K e Km 6*<p 

C 2 dt c 2 ~dt 2 


(27) 

(28) 


with a similar equation for A. 

When we consider plane wave solutions of the wave equation, 
we find that in empty space E (in esu) is numerically equal to H 
(in emu). In a medium in which K e and Km are different from unity, 
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\/lc e E is numerically equal to -v/kw H. In a conducting medium, 
propagating a wave with angular frequency co, k 0 is replaced by 

4-3 rj<r 

K r — - ) 

CO 

so that \/K e — Arcja/co E is equ al to \Z~icZ H, and the complex index of 
refraction, instead of being y/K e K m , equals y/tc m (K e — Afirja-jco ). From 
these formulas, the properties of reflection from conducting surfaces, 
skin depth, etc., can easily be found, but it should be remembered 
that in Gaussian units we must use <r in esu. 

From Maxwell’s equations we can prove Poynting’s theorem, which 
proves to be 

div [± (E X H)] + | [£(*»■ + *-**)] = -E . J. . (29) 

From this we infer that Poynting’s vector, in unrationalized Gaussian 
units, is (c/47r)(E X H), and the electric- and magnetic-energy densi¬ 
ties are k c E 2 /Stt and K m H 2 /S-7r, respectively. We note from our relation 
between the magnitudes of E and H, mentioned above, that the 
magnetic- and electric-energy densities in a plane wave are equal to 
each other. Applying this result to a spherical solution of the wave 
equation, we find that the rate of radiation from a dipole of amplitude 
M , angular frequency co, is 

J Sda = ( 3 °) 

analogous to Eq. (4.2), Chap. XII. Similarly the scattering cross 
section of a scattering electron, analogous to (5.3), Chap. XXI, is 

8ir m 4 ( e»V f o n 

3 («8 - O) 2 ) 2 + («fir) 2 \mcV ^ ’ 

We have now indicated the equivalent, in unrationalized Gaussian 
units, of many of the most important formulas that have been encoun¬ 
tered in mks units in the body of the text. Next we consider the 
practical units. These are based on the emu, but differ from them 
by arbitrary powers of 10, chosen to make the practical units of con¬ 
venient size. The two most fundamental units are the ampere and 
the volt. These are arbitrarily defined by the following equations: 

1 ampere — 10~ l emu of current 
1 volt = 10 s emu of potential. 


( 32 ) 
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It follows from this that in the practical system the rate of working, 
or power, when 1 amp flows through a potential difference of 1 volt, 
is 10 7 ergs/sec, or 1 joule/sec, or 1 watt, so that power in the practical 
system is measured in watts. The coulomb is of course defined in 
terms of the ampere, and the unit of electric field is ordinarily the 
volt per centimeter. Using (10), we see that 

1 esu of charge = 3 - 998 ^ 10 9 coulombs > 

1 esu of potential = 299.8 volts. (33) 

We see by dividing that 

1 ohm = 10 9 emu of resistance 

= (2 7 998) ^ X 10 11 eSU ° f resistance * (34) 

From Coulomb's law (1) we now see that the force between two 
charges q and q' } expressed in coulombs, at a distance of separation r, 
measured in centimeters, is F = (2.998) 2 X 10 1 B qq'/r 2 dynes or 
(2.998) 2 X 10 xz qq r Jr* newtons. If r is measured in meters, the force is 

F = (2.998) 2 X 10 9 newtons. (35) 

This is identical with Eq. (2.2) of Chap. I, F = qq'/Aire 0 r 2 , so that 
€0 = 1/[4tt X (2.998) 2 X 10 9 ] = 8.85 X 10~ 12 , as in Eq. (2.1) of Chap. 
I. We note that (2.2) of Chap. I, which is the basis of our treatment 
of electrostatics in rationalized mks units, nevertheless uses as a 
unit the coulomb, which is an unrationalized unit, being based on the 
unrationalized emu. The factor Air which appears in the definition 
of € 0 compensates for this. 

Similarly from the law (9) of force between two current elements, 
we see that the force between lengths ds\ and ds% of wire carrying 
currents of ix and ii, in amperes, at distance r, is 

dF = | ( r p Sa X f)] dynes 

= 10 r-HiU [ g? 1 X (dsg X r)] newtons . (36) 

These formulas hold whether distances are in centimeters or in meters, 
since the di m ensions of length cancel. Thus we verify Eq. (1.4) of 
Chap. V, in which we see that mo must be defined as in Eq. (1.2), 
Chap. V. When we come to magnetic fields, we find that the unit in 
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actual use is the gauss, which as we have seen is the emu of magnetic 
induction B. If then we have a current i, in amperes, flowing in a 
length ds of wire, in centimeters, in a magnetic induction B, in gausses, 
the force dF, in dynes, is 

dF = 10"“k*(ds X B) dynes. 

Similarly if ds is in meters, dF in newtons, we have 

dF = 10~H’(ds X B) newtons. 

It is evident that to eliminate the numerical factor 10 -4 in the mks 
system we must use a unit of B 10 4 times as large as the gauss. This 
unit is the weber per square meter. 

The weber is a unit of flux, not of magnetic induction. It is a 
practical unit, based on Faraday’s law: a time rate of change of flux 
of 1 weber/sec through a circuit induces an emf of 1 volt in that 
circuit. We recall, however, that a time rate of change of flux of 
1 gauss/sec through an area of 1 sq cm induces an emf of 1 emu. 
Since 1 volt = 10 8 emu, this means that it requires a change of flux 
of 10 8 gausses per second flowing through an area of 1 sq cm, or of 
10 4, gausses/sec flowing through 1 sq m, to induce 1 volt. In other 
words, 

1 weber/sq cm = 10 s gausses, 

1 weber/sq m = 10 4 gausses. (37) 

Thus we establish the relation between the weber and the gauss. 

We have now discussed the practical units of electrical and mag¬ 
netic quantities. At the same time, we have established the laws of 
force between charges and currents, and the other fundamental 
relations on which our treatment of the mks units in the text is based. 
We have not taken up the lucky accident by which the powers of 
10 in terms of which the practical units of current and voltage are 
related to the emu, and the powers of 10 by which the meter and the 
newton arc related to the centimeter and the dyne, are just such 
that they can be combined into a consistent set of units involving no 
powers of 10 in the final equations. This is in fact an accident, but 
when advantage is taken of it, we are led to the set of units taken 
up in the main discussion in the text. This set of units is still unfamil¬ 
iar to many readers, who were brought up to use the Gaussian units. 
The writers believe firmly that the mks units are to be preferred 
to the Gaussian units, not only for practical calculations, but for all 
purposes. The equations expressed in terms of mks units are certainly 
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no more complicated, and in many ways they are simpler, than those 
expressed in terms of Gaussian units. To that is added the great 
advantage that, whenever results are to be translated into practice, 
the units to be used in the mks system are those in practical use, 
whereas in the Gaussian system they are all unfamiliar, and a con¬ 
version table must be used. The only constants that need be remem¬ 
bered in the mks system are «o and jx o, and these have definite physical 
meanings: €o measures the capacity of a condenser whose plates are 
each 1 m square, and 1 m apart; jx 0 measures the self-inductance, per 
meter length, of a transmission line consisting of two parallel-plane 
conductors, each 1 m broad, spaced 1 m apart. (Each of these defini¬ 
tions, of course, implies suitable conductors outside those in question, 
to prevent fringing of fields.) 



APPENDIX III 
FOURIER SERIES 


Fourier’s theorem may be stated as follows: Given an arbitrary 
function <p(x). Then [unless <p(x) contains an infinite number of dis- . 
continuities in a finite range, or similarly misbehaves itself], we can 
write 


where 


#>(*) = 


A 0 , V (a 

t + 4 \ An 008 ir 

n» 1 


2 [ x/2 

A n = ^ <p(x) cos 

A.J -X/2 

2 f x/2 

B n = v I sin 

J -X/2 


+ sin 



2ima; 

~X~ 

2rnx 

~X~ 


dx, 

■ dx. 


( 1 ) 


( 2 ) 


This equation holds for values of x between —X/2 and X/2, but not 
in general outside this range. The series of sines and cosines is 
called “ Fourier’s series.” There are two sides to the proof of Fourier’s 
theorem. First, we may prove that, if a series of sines and cosines 
of this sort can represent the function, then it must have the coeffi¬ 
cients we have given. This is simple, and we shall carry it through. 
But second, we could show that the series we so set up actually repre¬ 
sents the function. That is, we should investigate the convergence 
of the series, show that it does converge, and that its sum is the 
function <p(x). This second part we shall omit, merely stating the 
results of the discussion. 

Let us suppose that <p(x) is represented by a series as in (1), and 
ask what values the A’s and B’s must have if the equation is to be true. 
Multiply both sides of the equation by cos (27rmx/X), where m is an 
integer, and integrate from —X/2 to X/2. We then have 



<p(x) cos 


+ 



cos 



cos 


2 mux 
X 


„ . 2tt nx 
B n sin - cos 


2irmx\ 

) 


dx. 
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But we can easily show by direct integration that 


' %mx 2 icmx , „ 

cos — z==— cos — ^r— ax = 0 

■X/2 A A 


if n and m are integers, unless n — m, and that 


' . 2irnx 2irmx , ^ 

sin —cos —==— ax — 0 
-x /2 A A 


if n and m are integers. Thus all terms on the right of (3) are zero 
except one, for which n — m. The first term falls in with this rule, 
when we remember that cos 0=1. This one term then gives us 


•/: 


2 imx 


as we can readily show. Hence 


-l/: 


dx = An-^) 


2irnx 


/ N 1/1/ 

<p(x) cos —Y 
X/2 A 


In a similar way, multiplying by sin {2k mx/X), we can prove the 
formula for B n . 

We have thus shown that, if a function <p(x) is to be represented 
by a series (1), the coefficients must be given by (2). We shall next 
make a few remarks about the other part of the problem, the question 
whether the series so defined really converges to represent the func¬ 
tion tp{x). In the first place, the series cannot in general represent 
the function, except in the region between —X/2 and X/2. For the 
series is periodic, repeating itself in every period, whereas the func¬ 
tion in general is not. Only periodic functions of this period can 
be represented in all their range by Fourier series. If we try to 
represent a nonperiodic function, the representation will be correct 
within the range from —X/2 to X/2, but the same thing will auto¬ 
matically repeat itself outside the range. Incidentally, we can easily 
change the range in which the series represents the function. If we 
merely change the range of integration so as to be from x 0 to Xo + X, 
where xo is arbitrary, the series will represent the function within this 
range. The case we have used corresponds to xo = —X/2; another 
choice frequently made is x 0 = 0. 

We can also change the value of X, and thereby change the length 
of the range in which the series is correct. To represent a function 
through a large range of x , we may use a large value of X. In fact, 
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as X becomes infinite, the quantities %mx/X for successive n’s become 
arbitrarily close together, and the summations involved in (1) and (2) 
may be replaced by integrations. Thus, if we let %mx/X equal «, 
the interval dco between successive values of this quantity will be 
do 3 — 2tt/X. We may then replace (1) by the integration 


where 



/> 


cos cox B n sin cox) dco, 



JL 


L 

t. 


tp(£) cos col; 
<p(£) sin oof" d%. 


Here we have left out the term in Aq, which becomes negligible in the 
limit. Combining, we finally have 

(p{x) = - / da) <p{£) cos co(£ — x) d£. (4) 

This theorem expresses Fourier’s integral theorem, which as we see 
is merely the limiting form of Fourier’s series for infinite value of the 
period X. 

There is an alternative form of Fourier’s theorem (1) and (2), 
expressed in terms of exponential rather than trigonometric functions, 
which is simpler to write, and for many purposes is more convenient. 
To derive it, we express the sines and cosines in (1) and (2) in complex 
exponential form. Grouping together the terms in e i2xnx/x , and the 
terms in e ~* 2rnx/x , we find without trouble that an equivalent statement 
of the theorem is 

oo 

<p(x) = ^ Cn^ ixnx/X (5) 

ft M - OO 

where 

1 f x/2 

C n = v / <p(x)<r iiwn * /x dx. (6) 

A J -X/2 

The terms for equal positive and negative values of n combine to 
give the sines and cosines correctly, and the term for n — 0 gives 
the term in Ao in (1). Though the Cn s, from their definition, are 
complex, it is easily shown that, if <p(x) is real, C n and C_ n are complex 
conjugates, and the scries itself is real. We can easily write an expres- 
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sion of Fourier’s integral theorem, equivalent to (4), but in exponential 
language, similar to (5) and (6). 

Although the range within which a Fourier series converges to the 
value of the function it is supposed to represent is limited, as we have 
seen, to the value X, there is a compensation, in that within this range 
a Fourier series can be used to represent much worse curves than a 
power series. Thus a Fourier series can still converge, even though 
the function has a finite number of finite discontinuities. It can 
consist, for example, of one function in one part of the region, another 
in another (in this case, to carry out the integrations, we must break 
up the integral into separate integrals over these parts, and add them). 
The less serious the discontinuities, however, the better the converg¬ 
ence. Thus, if the function itself has discontinuities, the coefficients 
will fall off as 1/n; whereas, if only the first derivative has discon¬ 
tinuities, the coefficients will fall off as 1/n 2 , etc. Differentiating a 
function makes the convergence of a series worse, as we can see, for 
example, if a function is continuous but its first derivative is dis¬ 
continuous. Then the coefficients fall off as 1/n 2 , but if we differ¬ 
entiate, the coefficients of the resulting series will fall off as 1/n. 
There is an interesting point connected with the series for a discon¬ 
tinuous function. If the function jumps from one value u\ to another 
U 2 at a given value of x, then the series at this point converges to the 
mean value, (ui + n 2 )/2. 

Problems 

1. Expand in Fourier series the function that is equal to —x for x between 
—X/2 and 0, and equal to x for x between 0 and X/2. 

2. Expand in Fourier series the function that is equal to —1 for x between 
—X/2 and 0, and equal to 1 for x between 0 and X/2. See if this series can be 
found by differentiating the series of Prob. 1 term by term. Consider the con¬ 
vergence of these two series, with reference to their continuity. What happens 
if we try to differentiate again term by term? 

3 . Expand in Fourier series the function that is equal to x 2 for x between 
— X/2 and X/2. Compute the sum of the first four terms of this series, and see 
how good an approximation to the function you have. 

4 . Expand in Fourier series the function that is equal to zero except for x 
between —i/2 and i/2, where i <3C X, while in this region it equals unity. Discuss 
the behavior of this function in the limit as i becomes zero. 
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VECTOR OPERATIONS IN CURVILINEAR COORDINATES 


Let us assume three orthogonal coordinates q y , q 2 , qs, so that the 
three sets of coordinate surfaces, qi = constant, q 2 = constant, 
q 3 — constant, intersect at right angles, though in general the surfaces 
will be curved. Now let us move a distance ds y normal to a surface 
qi — constant. By doing so, q 2 and q 3 do not change, but we reach 
another surface on which q y has increased by dq 1} which in general is 
different from ds y . Thus, with polar coordinates, if the displacement 
is along the radius, so that r is changing, ds = dr; but if it is along a 
tangent to a circle, so that 0 is changing, ds = r dd. In general, we 
have 

dsi = hi dqi, ds 2 = h 2 dq 2 , ds 3 = h 3 dq 3 , (1) 

where in polar coordinates the h connected with r is unity, but that 
connected with 0 is r. The first step in setting up vector operations 
in any set of coordinates is to derive these h’ s, which can be done by 
elementary geometrical methods. Thus in cylindrical coordinates, 
where the coordinates are r, 0, z, we have ds y — dr, ds 2 = r dd, ds 3 = dz , 
so that hi — 1, h 2 = r, h 3 == 1. In spherical polar coordinates, r, 6, <p, 
we have ds y = dr, ds 2 = r dd, ds 3 = r sin 0 dtp, so that h y — 1, h 2 — r, 
h 3 = r sin 0. 

Gradient.—The component of the gradient of a scalar S in any 
direction is its rate of change in that direction. Thus the component 
in the direction 1 (normal to the surface q y — constant) is 

dS = (l\(dS\ 

dsi \hi) \d<7i/ 

with similar formulas for the other components. Thus in cylindrical 
coordinates wc have 

, e dS , q 1 dS , 0 dS 

grad r S = —- grach b = - —, grad* S = (2) 

and in spherical coordinates we have 
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Divergence. —Let us apply the divergence theorem to a small 
volume element dV = d$i ds 2 ds 3 , bounded by coordinate surfaces at 
qx, Qx + dqx, etc. If we have a vector A, with components Ai, A 2 , Az 
along the three curvilinear axes, the flux into the volume over the face 
at qi, whose area is ds 2 dsz, is (Ax ds 2 ds 3 ) qi , and the corresponding 
flux out over the opposite face is (Ax ds 2 ds 3 ) qi+ d qi , where we note 
that the area ds 2 ds 3 changes with qx as well as the flux density A 1 . 
Thus the flux out over these two faces is 


S d Id 

— (li ds 2 dsz) dqi — -r— (Axh 2 h s ) dqx dq 2 dq z = n~r (Axh 2 hz) dv. 
oq 1 oQ 1 oQi 

Proceeding similarly with the other pairs of faces, and setting the 
whole outward flux equal to div A dv, we have 

divA - To [4i {AMs) + w. {AMl) + i (4) 

Thus in cylindrical coordinates we have 


j- » 19/14.1 dAg , dA z 

div A = - — (rAr) H-— 4- -r 2 

r dr K J r dd dz 


(5) 


and in spherical coordinates 

d 1 vA = I 2 |(^) +r8ine ^ 


1 ^ (sin Q A e ) + 


1 dAy 
r sin 6 dip 


( 6 ) 


Laplacian.—Writing the Laplacian of a scalar S as div grad S, 
and placing Ax — gradi S, etc., in the expression for div A, we have 

V 2 S = — I" — ( ^ 2 ^ 3 ^ r d / hzhx dS \ . d / hxh 2 d/S\~| 

hxh 2 hz[_dqx\ hi dqx) dq 2 \ h 2 dq 2 ) dq 3 \ h z dqz )J ^ 

Thus in cylindrical coordinates we have 

r dr \ dr J r 2 
and in spherical coordinates 


d 2 S 
se 2 dz 2 


( 8 ) 




4~ 


r 2 sin 6 dd 


( sin e H) 


+ 


1 <ps 

r 2 sin 2 6 dtp 2 


(9) 


Curl.—We apply Stokes’s theorem to an approximately rectangular 
area bounded by qx, qx + dqx, qz, qz + dq 2 . The line integral of a 
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vector A about the circuit is 

Aifai^a) d$i + ^2(^1 + dqi,qi) ds 2 — Ai(qi } q 2 + dqz) dsi — A2(^1,£2) d$ 2 * 
This is approximately equal to 


— (^ 2 A*) — — (hiA{) dqi dq> 2 . 
Since this must be curb A dsi d$ 2 , we have 


( 10 ) 


“ rl ‘ 1 “ B 6 [ 5 (M,) “ 4 "* (Ml) ]' 


( 11 ) 


with similar expressions for the other components. Thus in cylindrical 
coordinates we have 

curl, A = - ~ — — 
r 86 8z 

, * dA r dA . 


curb A = ~ - ~ 2 
6s dr 

curl* A = -~ (rA e ) 

r dr v ’ r 86 


( 12 ) 


and in spherical coordinates 


curb A = - 4-~ ~ (sin 0 A v ) -— 

r sin 0 50 v r sm 6 dtp 

i a 1 dA r 1 d t . N 

r sin 6 dtp r dr v v/ 

1 a Id... 1 dA r 

v r dr v 7 r 50 


(13) 
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SPHERICAL HARMONICS 


In Chap. Ill, we considered the solution of Laplace’s equation in 
spherical coordinates, and showed that the solution may be written 
as a product of a function of r, a function of 9, and a function of <p. 
The function of <p is sin rrup or cos m<p, and the function 0 of 9 satisfies 
the differential equation 

>§) + [>“ + I> -*]<>-<>- <» 
We showed that the function © could be written in the form 

0 = sin” 9(A o 4 A 2 cos 2 0 4 A 4 cos 4 9 4- • • •) 
or 

= sin” 9{A 1 4 A z cos 3 6 + A$ cos 5 9 + • • •)> (2) 

where 

An _ _ (n 4- l)(w 4 2) _ 

A n +2 l{l 4 1) — (m 4 ri)(m 4 n 4 1) 

We found further that if l is an integer one or the other of the series 
in (2) breaks off to form a polynomial. On the other hand, we stated 
without proof that, if the series does not break off, it leads to a func¬ 
tion that becomes infinite when cos 9 = ± 1, so that such a function 
cannot be used for expanding solutions of Laplace’s equation that 
remain finite at cos 6 = ± 1. Thus it is only the polynomial solutions 
that are usually of use. 

These polynomials, expressed in a certain form, with suitably 
chosen values of the coefficients, are the associated Legendre poly¬ 
nomials. They are ordinarily expressed, not as in (2) in a series of 
increasing powers of cos 0, but in a series of descending powers, start¬ 
ing with the term of highest power. In this form, they are defined as 


p "( cos = m S1° % [ < c ° s - ~ 1} 

(l - m)(l - m - 1)(/ - m - 2 )(l - m - 3) , ny _ m _ l 

+ -2 • 4(2i — 1)(2Z — 3) - (oos ") “ ’ ' 


4- 


(— l) p (7 — m)(l — m — 1) 


(l — m — 2p 4 1) 


2-4 


(2p)(2 1 - 1)(2 1 - 3) 


(2 1 - 2p 4 1) 
4 


(cos d) l ~ m ~ 2p 
• • - ]. (4) 
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To show the identity of this expression with (2), we need merely show 
that the ratio of successive coefficients is as given in (3). To do this, 
we may identify the term in (cos e) l ~ m ~ 2p in (4) with the term in 
(cos 0) n in (2), and the term in (cos Qy-™-*p+* in (4) with that in 
(cos 0) n+2 in (2). Doing this, taking the ratio of coefficients from (4), 
we find easily that it is the same as given by (3), so that (4) forms a 
legitimate way of writing the solution (2). The constant factor 
multiplying the series in (4) is chosen to simplify certain formulas. 

For certain purposes it is more convenient to let cos 0 — x, and 
to express the associated Legendre polynomials in terms of x. The 
equation (1), when expressed in terms of x, is 

(1 - *!) H - 2 * g + [zcz + D - e = °- <*> 

The function (4) can be written in a form that is sometimes useful, 
and that can be justified directly from the differential equation (5). 
This is 

PTix) - ~ (1 - X s )” (x 2 - 1)'. (6) 


From the form (6), we can arrive at the expression (4) by expanding 
(x 2 — 1 ) l in binomial expansion, and carrying out the differentiation 
term by term. The Legendre functions are the special case of (4) or 
(6) for which m = 0. They are often denoted Pi (cos 0 ) or Pi(x). 

The associated Legendre polynomials have many important prop¬ 
erties, but for our present purposes the most useful ones are their 
orthogonality and normalization relations. It can be proved that 


P?(x)PZ(x) dx = 0 if l n 

_ _ 2 (l + m) ! . 

21 + 1 (Z - m )! 


(7) 


This relation can be used in the following way to satisfy boundary 
conditions over the surface of a sphere, in solutions of Laplace’s 
equation: As we see from Eq. (3.3), Chap. Ill, such a solution can be 
expressed as a function of the angle by 

^ ^ P? (cos 6)(A lm sin rrup + B im cos m<p). (8) 

Irn 

In this summation, l is to go from 0 to infinity, m from 0 to l (for from 
our definitions we see that ?n can never be greater than l). Let us 
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suppose that the sum (8) is to be a specified function of 0 and <p, which 
we may write V'oCcos 6,<p). Let us then multiply both sides of (8) by a 
particular function, say Pj(cos 0) sin n<p, and integrate over all solid 
angles (which amounts to integrating over the surface of the sphere). 
The element of solid angle is sin d dd d<p. If we let x = cos 0, we have 
dx = — sin 0 dO. Thus we have 

f-i f ^ s i n n< P fate#) - ^ Pf{x)Pl(x) dx 

r 2 tt 

/ sin sin cos mp) 

Because of the orthogonality of the sines and cosines, the integral 
over <p on the right is zero unless n = m, in which case it is irAim. 
Setting n = m, the integral over x on the right is zero, because of (7), 
unless l = p, in which case its value is given by (7). Thus we have 

2Tfl (I - m)\ Alm = /_! dx f 0 sin wMw) (9) 

In a similar way we can find a formula for Bi m . 



APPENDIX VI 
MULTIPOLES 

The properties of multipoles can be conveniently considered either 
in rectangular or in spherical coordinates. We shall give both types 
of discussion, later indicating briefly the relation between them. 
First we define the dipole and quadrupole moments of a distribution 
of charges. Suppose we have a collection of point charges, all located 
near the origin. Let the ith charge be and let it be located at a 
point with coordinates &, rn, ft. Then the dipole moment is a vector, 
which we may denote by p (1 \ with components 

Vx l) = ^ V *i 1> = ^ dVi, ^ e t -ft. (1) 

i i i 

It is easy to see that this is consistent with the definition of dipole 
moment given in the text, by considering for instance a dipole con¬ 
sisting of a charge +c at point d along the x axis, and charge — e at 
the origin; its dipole moment is then a vector along the x axis, of 
magnitude cd. Similarly the quadrupole moment is a tensor. (The 
reader unfamiliar with tensor notation will find a discussion in Appen¬ 
dix V of Mechanics, by ,J. C. Slater and N. H. Frank, McGraw-Hill 
Book Company, Inc., New York, 1947.) We may denote it by p (2) , 
and its components are 



In a similar way one can define an octopole moment p (3) , whose com¬ 
ponents are products of three displacements, and which therefore has 
three subscripts, etc. 

We encounter the various moments of the charge distribution both 
in finding the field of the distribution at distant points, and in finding 
its potential energy in an arbitrary potential field. The electrostatic 
potential ^ at point x , y , z , arising from the charge distribution, by 
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Coulomb's law, is 



t 


( 3 ) 


where p» — -y/(x — &) 2 + (y ~ 4- (z — ft) 2 - Assuming that x, y, 

z are larger than &, t] i} ft, we may expand l/p t -, in (3), in Taylor's series 
in 17 *, ft- In doing this, we must take the derivatives of pi with 
respect to £»•, rji, ft, and then set these quantities equal to zero. We . 
note, however, that the derivative of pi with respect to & is the nega-L 
tive of its derivative with respect to x , etc. We may then replace 
the derivatives ’ by the negative derivatives of r — \/ x 2 + y 2 + z 2 
with respect to x, y, z. Proceeding in this way, we find that 


* = 




47TC0 


V? - 


d(l/r) 


dx 


4- - 

i 2 rxx 


d 2 (l/r) 
dx 2 


l( D *(Vr) 

i . W) , WO 


+ vV 


dy 


+ V 


m*(l/r)l 

' ds J 


+ o PlS 


_|_ W2) ^ 2 (I/ r ) i ^(2) 
^ Vxv dx dy ^ Vvz 


dy 2 
d 2 (l/r) 
dy dz 


4- - z> (2) - 

-r 2 a;g2 


+ p! 


<» ^ 2 dA) , 

ZX *\ _ 4 


6z 6a; 


- (4) 


Here the first term represents the potential of the total charge, as 
if it were concentrated at the origin, the next three terms represent 
the potential of the dipole moment, the next of the quadrupole moment, 
etc. 

The expression (4) is easier to interpret if we note the form of the 
various derivatives. We have 


6(l/r) _ _ 6 2 (l/r) _ _ 1 3a; 2 6 2 (l/r) _ 3 xy 

dx , r 3 dx 2 r 3 r 6 ’ dx dy r B ’ 

with similar formulas for the other derivatives. Thus we see that 
the potential arising from the dipole moment is 

. = __i_ [pL 1 ^ + p£v + y^z\ 

y 47reo r 3 ‘ w 


In the numerator we have the scalar product of the dipole moment 
and the radius vector^ since this is the magnitude of the dipole moment, 
times r, times the cosine of the angle between, we verify Eq. (5.1) of 
Chap. Ill for the potential of a dipole. We note further that the 
potential arising from a dipole is proportional to 1/r 2 times a function 
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of the angle, and by examination of (4) and (5) we find similarly that 
the potential arising from a quadrupole is proportional to 1/r 3 times 
a function of angle, with correspondingly increasing exponents of the 
inverse powers for the higher multipoles. 

The type of expansion we have used can be employed to find the 
potential energy of a charge distribution in an external field. Let there 
be a potential function ip(x,y,z) arising from charges external to the 
distribution we are considering. Then the potential energy of the 
distribution in this external field is 


i 

Expanding by Taylor’s theorem, this becomes 

V = (X «) * (0) + P "’ to + P " t + p<;> to 

+ +10 + >9 Sti + 



(7) 


We note that the dipole terms in the potential energy form the scalar 
product of the dipole moment and the gradient of the potential, or 
the negative of the scalar product of dipole moment and electric 
field. By investigating the change of V, in (7), when the charge 
distribution is given a translation or a rotation, we can find the 
forces and torques acting on the charge distribution. We find, of 
course, that there is no net force acting on a dipole in a uniform field; 
the force arises only when the field strength varies with position. 
There is, however, clearly a torque acting on a dipole in a uniform 
field, tending to force the dipole into parallelism with the field. 

We shall now express the formulation of the potential arising from 
a charge distribution in spherical polar coordinates. Let the coordi¬ 
nates of the charge be r,, 0<, </>,-, and the coordinates of the point 
where we wish to find the potential r, 9 , <p. Then, writing x , y, z, 
£», Vi t in terms of spherical coordinates, we find that the distance 
between the point r, 6 , <p, and the point r,-, 0», <pi where the ith charge is 
located is 


p,- — a/ r* — 2/t,[cos 0 cos 0; + sin 0 sin 0t cos (<p — <pdl -t- r\. (8) 


We then substitute in (3) to get the potential. In deriving this, 
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we shall make use of a very important theorem relating to spherical 
harmonics, whose proof we shall not give. This is the following: 

oo 

w = X r^ 1 l_ P ' (cos *> p «(oos ft) 

iZo 

i 

+ 2 X {r + mj ! P "( cos 8)^7(cOS ft) cos m(<p - W )1 (9) 

if r > r*. This expansion suggests setting up the following quantities, 
which serve as the various components of multipole moments in 
spherical coordinates: 



\ 

1 eir\Pi {cos 9i) 


i 

p$l i = 2 

V (Z — m)\ 7rt . 

(l -\- m)\ e£7 ^™( cos d i) cos m<pi 

i 


Pm, 2 = 2 

d 

^ (7 4 _ m ) j cos fi) sm rrupi. 

(10) 


We shall point out the nature of these quantities later, and shall 
show that those for l = 1 describe the dipole moment, those for / = 2 
the quadruple moment, etc. In terms of them, the potential is then 


* = 4^To X^ { P »’ Pj(C0S 6) 


l 

-h X M.lPr(C0S 9) COS m<p + Pm,2pr( COS 9) sin m<p] 

77 % — 1 


( 11 ) 


Let us now examine the expression (11), to find its significance. 
The term in l = 0 is very simple. Remembering, as follows from 
Appendix V, that P 0 (cos 9) = 1, it is 



4x60 T 


expressing the potential of the total charge, as in the first term of (4). 
For the term in l = 1, we have Pi(cos 5) = cos 9, PJ(cos 9) = sin 9. 
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Then 



6ifi COS Si 





e,r< sin Si cos <pi = 








Thus these three quantities are the components of the dipole moment, 
as we have defined them previously. Substituting in (11), we find 
that the term for l = 1 reduces to exactly the expression (5), when 
we remember that x = r sin S cos <p, y = r sin S sin <p, z = r cos S. 
In a similar way we can identify the terms in l ~ 2 in (11) with the 
quadruple terms in the rectangular case, though this identification 
is not so simple as with the dipole, and involves a paradox, in that 
there are only five components of the form given in (10), whereas we 
found six components of the quadrupole moment in (2). The paradox 
becomes resolved when we write out the potential in detail in both 
coordinate systems. Proceeding in a similar way, we see that the 
term of (11) corresponding to each value of l expresses the potential 
arising from a particular multipole. 



APPENDIX VII 
BESSEL’S FUNCTIONS 


Bessel’s equation is 

1 d ( dZ\ (, m 2 \ _ n 

zdz\ e di)+ [}-*) Z = 0 ’ W 

and its solutions are Z = J m (z), Z = N m (z). By expanding Z in 
power series, we may show that 



(m + 1) 


,(!)“ + 


2 \(m + 2 ) 


i dr 


( 2 ) 


where the coefficient of the first term is chosen according to an arbi¬ 
trary convention. There is no similarly simple expansion for the 
Neumann function N m (z); it requires not only terms in positive and 
negative powers of z, but also logarithmic terms. For N 0 (z), the 
leading term for small values of z is the logarithmic term: 


lim N 0 (z) = - (In z — 0.11593). 

z—> 0 7T 

For the higher values of m, the term in inverse powers of z is the 
leading term for small z: 


lim N m (z) = - {m 1)1 m > 0. (3) 

For values of z large compared with m, the Bessel and Neumann func¬ 
tions have asymptotic expressions as follows: 

Urn JJz) = x fl cos (z - 2m + 1 tt\ 
z-+ oe \7TZ \ 4 / 

“n Nm(z) = sin (* - 2jl r 1 4 w 

An important relation connecting tiie Bessel and Neumann functions is 


N m ^i(z)J m (z) - N m (z)J m— 1 
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An important relation gives J m (z ) as an integral: 


Jm(z) — 



where j = \/ — 1. 


cos (mw) dw, 


In addition to these relations, there. are a group of important 
relations holding equally well for either the Bessel or Ne umann func¬ 
tions, and which we shall therefore state in terms of Z m (z), meaning 
by this either J m (z) or N m (z). For the Bessel functions, most of 
these relations can be proved without difficulty from the series repre¬ 
sentation. T- he relations are more general, however, in that they apply 
to the Neumann functions as well, and can be proved directly from the 
properties of the differential equation. They are the following: 


£m-l(z) + Z m+1 (z) = ~ Z m (z ) 

£ 

Z— tn (z) — (— l) m Z m (z), if m is integral 
^ Zjn(z) = 2 [Zm— l(z) J2 m 4-x(z)] 

[2”'Z™(*)] = Z”Z ra _!(2) 

[2r—Z,„(z)]-r"Z wl (j) 


I Z,(z) dz = -Z„(z) 
j zZ„(z) dz — zZi(z) 
j Zl(z)z dz = [Z%(z) + Z-f(s)] 

j Z;„(z)z dz = ^ [Z'i(z) — Z (z)Z m+ i(z)]. 


In Chap. XII, See. 2, we introduced spherical Bessel and Neumann 
functions, defined by the equations (2.1) of that chapter. We showed 
in Eq. (2.2) of that chapter that they can be expressed in terms of 
algebraic and trigonometric.', functions; we found their asymptotic 
behavior in (2.3), and t heir behavior for small z in (2.4), all of that 
chapter. These properties can bo easily derived from the relations 
we have already given. For the series representation, we must 
modify (2) for nonintegral values of m, replacing m! where it appears 
in that equation by the gamma function F(m + 1), where the gamma 
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function satisfies the functional relation 

T(m +1) = 

and where r(H) = Vr. With this modification, (2) and (3) lead 
to Eq. (2.4), Chap. XII. The asymptotic behavior for large z given 
in Eq. (2.3), Chap. XII, follows directly from (4). To prove the 
trigonometric nature of the functions, we can first fmd.jo(z) and n 0 (z) 
directly, showing that as given in (2.2), Chap. XII, they are given by 
(sin x)/x and (cos x)/x, respectively. To do this, we can substitute 
the corresponding functions J 1 / 2 ( 2 ) and iV 1 / 2 ( 2 ), defined from them by 
(2.1), Chap. XII, directly in Bessel’s equation (1), and show that it is 
satisfied. Then we can derive the functions of higher index from 

these by the relation 

| [r*ZM = -r»Z m+ i(z), 

which we have given above, leading immediately to the algebraic and 
trigonometric nature of the higher functions, as well as to the explicit 

values for the functions. 



SUGGESTED REFERENCES 


Electromagnetic theory is too large a subject to be treated com¬ 
pletely in a single volume, and we give a few references in the present 
section, which the student familiar with this book can refer to, without 
too great difficulty. In the first place, the reader of inadequate 
preparation may wish to review his elementary electromagnetism and 
optics, and can use, for example, Introduction to Electricity and Optics, 
by N.H. Frank (McGraw-Hill). 

For general mathematical training and background, the student 
will first want texts on advanced calculus, such as Treatise on Advanced 
Calculus, by P. Franklin (Wiley), or Advanced Calculus, by E.B. Wil¬ 
son (Ginn). More advanced texts on analysis will be helpful, such as 
Mathematical Analysis, by Goursat and Hedrick (Ginn), Partielle 
Diffcrentialgleichungen der Physik, by Riemann and Weber (Rosen¬ 
berg), or Higher Mathematics, by R.S. Burington and C.C. Torrance 
(McGraw-Hill). Several texts on mathematics for special purposes 
are valuable: Applied Mathematics for Engineers and Physicists, by 
L.A. Pipes (McGraw-Hill); The Mathematics of Physics and Chemistry, 
by H. Margenau and G.M. Murphy (Van Nostrand); and Mathe¬ 
matical Methods for Engineering, by T. von Kdrmdn and M.A. Biot 
(McGraw-Hill). 

Among more specialized mathematical texts in the fields of particu¬ 
lar importance to electromagnetism are Introduction to Higher Algebra, 
by M. Bocher (Macmillan); Ordinary Differential Equations, by E.L. 
Ince (Dover); Fourier Series and Spherical Harmonics, by W.E. 
Byerly (Ginn); Newtonian Potential Function, by B.O. Peirce (Ginn); 
Fourier Series and Boundary Value Problems, by R.V. Churchill 
(McGraw-Hill); Vector Analysis, by II.B. Phillips (Wiley); and Vector 
and Tensor Analysis, by ILV. Craig (McGraw-Hill). The standard 
volumes of tables, A Short Table of Integrals, by B.O. Peirce (Ginn) and 
Funktionenlafeln, by Jahnke and Km do (Dover), will be found invalu¬ 
able for detailed assistance in calculation. For definite integrals that 
are not given in these books, Nouvelles Tables d'Intdgrales Ddfinies, by 
Bierens de Haan (Stnchert), will be found a source of much information. 

A number of general texts on theoretical physics cover electro¬ 
magnetic theory among other topics. Among these we may mention 
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Introduction to Theoretical Physics, by L. Page (Van Nostrand); Theo¬ 
retical Physics, by G. Joos (Stechert); Introduction to Theoretical Physics, 
by A. Haas (Constable); Introduction to Mathematical Physics, by 
It.A. Houstoun (Longmans); and Principles of Mathematical Physics, 
by W.V. Houston (McGraw-Hill). Two longer treatises on theoretical 
physics, in several volumes, may also be mentioned: Introduction to 
Theoretical Physics, by M. Planck (Macmillan), an English translation 
of a well-known German text, and Einfuhrung in die theoretische 
Physik, by C. Schaefer (De Gruyter). The last two works go a good 
deal more into detail than is possible in the present book. 

Next we come to a number of references dealing with the various 
branches of electromagnetic theory. Among general references, two 
standard works are Classical Electricity and Magnetism, by Abraham 
and Becker (Blackie), and The Mathematical Theory of Electricity and 
Magnetism, by J.H. Jeans (Cambridge). Treatments on approxi¬ 
mately the scale of the present text are given in Principles of Electricity, 
by L. Page and N.I. Adams (Van Nostrand), Electric Oscillations and 
Electric Waves, by G.W. Pierce (McGraw-Hill), Principles of Elec¬ 
tricity and Electromagnetism, by G.P. Harnwell (McGraw-Hill), and 
Static and Dynamic Electricity, by W.R. Smythe (McGraw-Hill). 
More advanced points of view are presented in Electromagnetic Theory, 
by J.A. Stratton (McGraw-Hill), and Electromagnetic Waves, by S.A. 
Schelkunoff (Van Nostrand) 

A number of texts handle more specialized problems. In electron 
theory, and the theory of dielectrics and magnetic materials, there are 
Theory of Electrons, by H.A. Lorentz (B.G. Teubner); Theory of Elec¬ 
tric and Magnetic Susceptibilities, by J.H. Van Vleck (Oxford); Polar 
Molecules, by P. Debye (Chemical Catalog Co.); and Introduction to 
Ferromagnetism, by F. Bitter (McGraw-Hill). ■ The theory of wave 
guides and microwaves is handled in a number of recent texts, includ¬ 
ing Microwave Transmission, by J.C. Slater (McGraw-Hill); Fields 
and Waves in Modern Radio, by S. Ramo and J.R. Whinnery (Wiley); 
and Hyper and TJltrahigh Frequency Engineering, by R.I. Sarbacher 
and W.A. Edson (Wiley). The subject of optics is handled from the 
standpoint of electromagnetic theory in Lehrbuch der Optik, by K. 
For sterling (S. Hirzel), and Optik, by M. Born (Springer). Other 
standard treatments of optics are given in Fundamentals of Physical 
Optics, by F.A. Jenkins and H.E. White (McGraw-Hill), and in 
Physical Optics, by R.W. Wood (Macmillan). 
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Anomalous dispersion (see Dispersion) 
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Coefficients of Courier series, 30-31, 
164-165, 217-220 
Coherence of light, 161 165 
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Conduction of electricity in metals, 
electron theory of, 106, 11 1-1 12 
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111-112 

Conductor of electricity as an equi- 
potential, 15 
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Continuity, equation of, 84 
Continuity conditions, electromagnetic 
wave, 117, 137 


Convergence of Fourier series, 217-220 
Coordinates, curvilinear, vector opera¬ 
tions in, 221-223 
Cornu’s spiral, 185-186 
Coulomb, 10-12, 214 
Coulomb’s law, 2, 12, 158 
Cross section, scattering, 160 
Curl, in curvilinear coordinates, 222-223 
of electric field, 14, 79-80, 83 
of a vector, 14, 199 
Current, density of, 61 
displacement, 83-85 
surface, 67 

Curvilinear coordinates, vector opera¬ 
tions in, 221-223 
Cutoff in wave guide, 129, 146 
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D’Alembert’s equation, 88 
solution of, 169-173 
Demagnetizing factor, 73-75 
Density, charge, 23 
current, 61 

of energy in electromagnetic field, 
95-103, 213 
Diamagnetism, 65 
Dielectric constant, 43-44, 105-114 
relation of to polarization, 43-44 
Dielectrics, 41-52, 105-114 
types of, 41-42 
Diffraction, 175-192 
Dipole and dipole moment, electric, 
35-42, 227-231 
magnetic, 59, 70 
oscillating, 157-161 

Discontinuity in electromagnetic field, 
44-46, 71, 117, 137 

Dispersion, of electromagnetic waves 
in metals, 111-112 
electron theory of, 105 1 14 
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Displacement current, 83 -85, 211 
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Divergence, in curvilinear coordinates, 
222 

of electric field, 24 
of magnetic induction, 57 
of a vector, 24, 199 
Divergence theorem, 200—202 
Dominant mode, 130 
Double layer, 35-39, 171 

E 

t 

Einstein's theory of relation between 
energy and mass, 101 
Electric conductivity, 106, 111-112 
Electric displacement, 42—43, 206 
Electric field, 1—52, 206 
in a spherical cavity, 52 
Electric images, 16-17, 47-49 
Electromagnetic field, energy in, 95— 
103 

of oscillating dipoles, 157-161 
wave equation for, 91 
Electromagnetic induction, 78—83, 210— 
211 

Electromagnetic units, 208-211 
Electromagnetic waves, 90—95, 103 
in metals, 90—95, 103, 111—112 
polarization of, 124 
reflection and refraction of, 117—128 
spherical, 148—165 
Electromotive force, 79—81 
Electron, 11 

radius of, 101, 160 

Electron theory and dispersion, 105—114 
Electrostatic field, energy in, 95-103, 
213 

Electrostatic potential, 14-18, 86—88 
Electrostatic problems and potential 
theory, 11—52 

Electrostatic units, 205-208 
Ellipsoid, magnetized, 75-75 
Energy, electrical, 95-103, 213 
magnetic, 97-103, 213 
Equation of continuity, 84 
Equipotential surfaces, 15—17 

F 

Farad, 11 

Faraday’s induction law, 3, 78-83, 
210-211 


Fermi statistics, 114 
Ferromagnetism, 66—67, 72-77 
Field (see Electric field; Electromag¬ 
netic field; Electrostatic field; 
Magnetic field) 

Flux, magnetic, 79-80 
Force on charge and current, 10, 54—55, 
205-206 

Fourier series, 30-31, 164-165, 217-220 
Fraunhofer diffraction, 180-192 
Fresnel diffraction, 180-187 
Fresnel equations for reflection, 6, 
120-128 

Fresnel integrals, 185 
Fresnel zones, 173-178 

G 

Gauss, 54 

Gauss’s theorem, 3—4, 19-21, 34 
Gaussian units, 211—213 
Generalized coordinates, vector opera¬ 
tions in, 211-223 

Gradient, in curvilinear coordinates, 
221 

of a scalar, 14, 198—199 
Green’s distribution, 38-39 
Green’s method for differential equa¬ 
tions, 25-26, 38-39 
Green’s theorem, 25, 170 

H 

Henry, 54 

Hertz electric waves, 7, 93 
Hertz vector, 166 
Huygens’ principle, 5, 167-178 
Hysteresis, 73-77 

I 

Images, method of, 16-17, 47-49 
Impedance, wave, 95 
Index of refraction, 105-114 
Induction, electromagnetic, 78-83, 210- 
211 

Induction vector, magnetic, 54-77 
Intensity, of electric field, 10 
of magnetic field, 69 
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L 

Laplace’s equation, 4, 23—27, 29—40, 
44, 168-173 

Laplacian in curvilinear coordinates, 
31, 222 

Leakage, magnetic, 75—77 
Legendre polynomials, associated, 32— 
34, 152, 224-226 

Legendre’s equation, 32, 152, 224-226 
Lines of force, 15—17 
Lippman fringes, 132 
Lorenz-Lorentz law, 110 

M 

Magnetic field, 53-77, 209—210 
Magnetism, 53—77, 209-210 
Magnetization vector, 67-70, 209-210 
Magnetomotive force, 75-77 
Maxwell’s equations, 2-7, 43, 58, 85- 
86, 94-95, 148-149, 211 
Metals, classical electron theory of, 
111-112 

plane waves of light in, 90-95, 103, 
111-112 

reflection of light from, 125—128 
Method of images, 10-17, 47-49 
Modes in wave guide, 129 
Molecular refractivity, 116 
Multiple-valued potential, 02 
Multiples, 36, 227-231 
Mutual inductance, 80--82, 211 

N 

Neumann functions (.see Bessel's equa¬ 
tion ) 

Newton’s law of mot ion, 1 
Normal ineidonee, rollertion eoeineienf 
at, 119-120 

O 

Oersted, 3, 53 
Ohm, 214 

Ohm’s law, 15, 208 

in differential form, 85 
Open circuits, 83 85 
Optics, 90-128, 148-194 


Oscillating dipole, radiation from, 157— 
161 

Oscillator strength, 110 

P 

Parallel-plate condenser, 21—22, 206 
Paramagnetism, 65 
Permeability, magnetic, 69—70 
Permittivity, 43 

Phase change in total reflection, 124 
Plane waves, 90—95, 103 

reflection and refraction of, 117—128 
Poisson’s equation, 4, 23—27, 31, 63 
Polarizability, 37, 44 
Polarization, of dielectric, 42—46 
of light, 124 

Polynomials, Legendre, 32, 152, 224r-226 
Potential, electrostatic, 14—18, 86—88 
magnetostatic, 57—62 
retarded, 168-173 
scalar, 14-18, 86-88 
vector, 62-64, 82-83, 86-88 
Poynting’s vector, 99—103 
practical units for, 213—215 
Products of vectors, 196—198 
Progressive waves, 90-95, 103 
Propagation constant, 91-92, 126—127 

Q 

Quadrupoles, 227—231 
Quantum theory, 7—9, 112—114 

It 

Radiation, electromagnetic, 90—103, 
148-165 
Rayleigh, 130 
Rayleigh scattering, 160 
1 {.cflcrtion, 117-128 
Refraction, 117 128 
index of, 105 I 14 
Relativity, theory of, 9 
Relaxation time, 89 
Resistance, specific, 85, 111 112 

Resolving power, grating, 192 
of a lens, 189- 190 
Resonance scattering, 161 
Resonant cavity, 135, 145 116 
Retarded potent, ml, 168- 173 
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S 

Scalar potential, 14-18, 86-88 
Scalar product, 196-197 
Scattering of light, 154-156, 159-165 
Self-inductance, 80-83, 211 
Separation of variables, 29-35 
Skin depth, 126 
Snell’s law, 118-119 
Solenoid, energy in, 98 
magnetic field in, 61, 71 
Solids, dispersion in, 109-114 
Specific conductivity, 85, 208 
Specific inductive capacity, 43, 208 
Specific resistance, 85, 208 
Spectral analysis of a light wave, 164— 
165 

Sphere, conducting, 35 
dielectric, 49-50 
magnetized, 72-74 

Spherical electromagnetic waves, 148— 
161 

Spherical harmonics, 32-34, 152, 224— 
226 

associated, or associated Legendre 
polynomials, 32-34, 152, 224— 
226 

Standing waves, 142-145 
Stokes’s theorem, 80, 202-203 
Surface-charge density, 20-21 
Susceptibility, electric, 44, 207-208 
magnetic, 70 


T 

Thomson’s formula for scattering of 
light, 160-161 
Toroid, 71 

Total reflection, 123-124 
Transverse electric and magnetic 
modes, 136-142, 144-150 
Traveling waves, 90—95,103 

U 

Unit vectors, 195-196 
Units, electrical, 10-12, 205-216 

V 

Vector operations in generalized coordi¬ 
nates, 221-223 

Vector potential, 62-64, 82-83, 86-88 
Vectors, 10,195-204 
Velocity of light, 93 
Volt, 10-12, 213 

W 

Wave equation, 91,139-140 
Wave guide, 129-146 
Waves, electromagnetic (see Electro¬ 
magnetic waves) 

Weber, 10, 54, 215 
Work, 14 

X 

X rays, 106,160-161, 163 




